Amino acid dipepetide frequency for Streptococcus satellite phage Javan360

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.326AlaAla: 2.326 ± 1.273
0.0AlaCys: 0.0 ± 0.0
2.584AlaAsp: 2.584 ± 0.659
4.91AlaGlu: 4.91 ± 1.748
3.101AlaPhe: 3.101 ± 0.832
1.55AlaGly: 1.55 ± 0.453
0.258AlaHis: 0.258 ± 0.265
3.876AlaIle: 3.876 ± 1.041
6.977AlaLys: 6.977 ± 0.889
3.876AlaLeu: 3.876 ± 0.958
0.775AlaMet: 0.775 ± 0.656
4.393AlaAsn: 4.393 ± 1.009
0.517AlaPro: 0.517 ± 0.359
2.842AlaGln: 2.842 ± 1.166
2.326AlaArg: 2.326 ± 0.537
2.842AlaSer: 2.842 ± 1.047
3.359AlaThr: 3.359 ± 0.69
3.618AlaVal: 3.618 ± 1.131
0.775AlaTrp: 0.775 ± 0.353
2.584AlaTyr: 2.584 ± 0.734
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.258CysAsp: 0.258 ± 0.229
0.517CysGlu: 0.517 ± 0.509
0.0CysPhe: 0.0 ± 0.0
1.034CysGly: 1.034 ± 0.448
0.0CysHis: 0.0 ± 0.0
0.258CysIle: 0.258 ± 0.236
0.517CysLys: 0.517 ± 0.363
0.517CysLeu: 0.517 ± 0.319
0.258CysMet: 0.258 ± 0.278
0.0CysAsn: 0.0 ± 0.0
0.258CysPro: 0.258 ± 0.229
0.258CysGln: 0.258 ± 0.265
0.517CysArg: 0.517 ± 0.317
0.0CysSer: 0.0 ± 0.0
0.258CysThr: 0.258 ± 0.258
0.775CysVal: 0.775 ± 0.434
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.775AspAla: 0.775 ± 0.381
0.517AspCys: 0.517 ± 0.342
3.618AspAsp: 3.618 ± 0.904
3.618AspGlu: 3.618 ± 0.637
5.168AspPhe: 5.168 ± 1.084
2.584AspGly: 2.584 ± 0.636
0.258AspHis: 0.258 ± 0.249
5.685AspIle: 5.685 ± 1.239
5.943AspLys: 5.943 ± 1.088
6.202AspLeu: 6.202 ± 1.428
1.55AspMet: 1.55 ± 0.548
4.393AspAsn: 4.393 ± 0.974
0.775AspPro: 0.775 ± 0.614
2.326AspGln: 2.326 ± 1.135
2.067AspArg: 2.067 ± 0.738
3.618AspSer: 3.618 ± 1.027
3.101AspThr: 3.101 ± 1.121
4.651AspVal: 4.651 ± 0.694
0.517AspTrp: 0.517 ± 0.318
1.55AspTyr: 1.55 ± 0.72
0.0AspXaa: 0.0 ± 0.0
Glu
4.91GluAla: 4.91 ± 0.831
0.775GluCys: 0.775 ± 0.387
5.168GluAsp: 5.168 ± 1.2
5.168GluGlu: 5.168 ± 1.017
2.326GluPhe: 2.326 ± 0.701
2.842GluGly: 2.842 ± 0.708
1.034GluHis: 1.034 ± 0.605
7.752GluIle: 7.752 ± 1.27
9.819GluLys: 9.819 ± 1.64
10.078GluLeu: 10.078 ± 1.961
1.292GluMet: 1.292 ± 0.705
4.393GluAsn: 4.393 ± 1.248
1.034GluPro: 1.034 ± 0.463
5.168GluGln: 5.168 ± 1.218
4.393GluArg: 4.393 ± 1.104
5.168GluSer: 5.168 ± 0.76
5.685GluThr: 5.685 ± 1.146
3.101GluVal: 3.101 ± 0.725
0.258GluTrp: 0.258 ± 0.236
3.359GluTyr: 3.359 ± 0.804
0.0GluXaa: 0.0 ± 0.0
Phe
1.809PheAla: 1.809 ± 1.025
0.517PheCys: 0.517 ± 0.313
4.651PheAsp: 4.651 ± 0.806
3.101PheGlu: 3.101 ± 1.117
1.55PhePhe: 1.55 ± 0.712
3.101PheGly: 3.101 ± 0.656
1.034PheHis: 1.034 ± 0.615
3.101PheIle: 3.101 ± 1.022
5.168PheLys: 5.168 ± 1.077
6.718PheLeu: 6.718 ± 0.881
0.258PheMet: 0.258 ± 0.229
1.55PheAsn: 1.55 ± 0.565
0.517PhePro: 0.517 ± 0.472
1.55PheGln: 1.55 ± 0.689
1.034PheArg: 1.034 ± 0.58
4.393PheSer: 4.393 ± 1.27
1.809PheThr: 1.809 ± 0.91
2.842PheVal: 2.842 ± 0.776
0.517PheTrp: 0.517 ± 0.271
1.809PheTyr: 1.809 ± 0.847
0.0PheXaa: 0.0 ± 0.0
Gly
2.067GlyAla: 2.067 ± 0.755
0.517GlyCys: 0.517 ± 0.316
1.55GlyAsp: 1.55 ± 0.857
1.55GlyGlu: 1.55 ± 0.684
3.876GlyPhe: 3.876 ± 0.8
1.809GlyGly: 1.809 ± 0.91
0.258GlyHis: 0.258 ± 0.219
3.101GlyIle: 3.101 ± 0.643
5.426GlyLys: 5.426 ± 1.259
3.618GlyLeu: 3.618 ± 1.013
1.034GlyMet: 1.034 ± 0.532
2.067GlyAsn: 2.067 ± 0.768
0.0GlyPro: 0.0 ± 0.0
1.034GlyGln: 1.034 ± 0.421
1.809GlyArg: 1.809 ± 0.565
3.359GlySer: 3.359 ± 0.957
1.292GlyThr: 1.292 ± 0.505
3.101GlyVal: 3.101 ± 0.648
0.517GlyTrp: 0.517 ± 0.356
3.359GlyTyr: 3.359 ± 1.198
0.0GlyXaa: 0.0 ± 0.0
His
1.292HisAla: 1.292 ± 0.537
0.0HisCys: 0.0 ± 0.0
0.258HisAsp: 0.258 ± 0.241
0.258HisGlu: 0.258 ± 0.268
1.034HisPhe: 1.034 ± 0.536
0.258HisGly: 0.258 ± 0.219
0.517HisHis: 0.517 ± 0.354
0.517HisIle: 0.517 ± 0.438
0.517HisLys: 0.517 ± 0.363
2.326HisLeu: 2.326 ± 0.86
0.0HisMet: 0.0 ± 0.0
0.258HisAsn: 0.258 ± 0.257
0.0HisPro: 0.0 ± 0.0
1.809HisGln: 1.809 ± 0.78
0.775HisArg: 0.775 ± 0.482
1.292HisSer: 1.292 ± 0.566
2.067HisThr: 2.067 ± 0.534
0.258HisVal: 0.258 ± 0.264
0.0HisTrp: 0.0 ± 0.0
0.775HisTyr: 0.775 ± 0.458
0.0HisXaa: 0.0 ± 0.0
Ile
4.393IleAla: 4.393 ± 1.244
0.258IleCys: 0.258 ± 0.249
5.426IleAsp: 5.426 ± 1.052
6.46IleGlu: 6.46 ± 1.706
3.101IlePhe: 3.101 ± 0.938
2.326IleGly: 2.326 ± 1.014
1.809IleHis: 1.809 ± 0.594
4.134IleIle: 4.134 ± 1.222
5.943IleLys: 5.943 ± 1.457
6.46IleLeu: 6.46 ± 0.772
0.775IleMet: 0.775 ± 0.552
4.91IleAsn: 4.91 ± 0.931
3.101IlePro: 3.101 ± 0.82
2.842IleGln: 2.842 ± 0.703
1.809IleArg: 1.809 ± 0.644
5.168IleSer: 5.168 ± 1.315
4.134IleThr: 4.134 ± 1.292
4.134IleVal: 4.134 ± 0.929
0.0IleTrp: 0.0 ± 0.0
1.809IleTyr: 1.809 ± 0.787
0.0IleXaa: 0.0 ± 0.0
Lys
4.91LysAla: 4.91 ± 1.055
0.775LysCys: 0.775 ± 0.435
5.426LysAsp: 5.426 ± 1.227
12.145LysGlu: 12.145 ± 1.867
3.359LysPhe: 3.359 ± 0.891
5.426LysGly: 5.426 ± 1.053
2.326LysHis: 2.326 ± 0.747
5.426LysIle: 5.426 ± 0.88
9.561LysLys: 9.561 ± 1.892
7.752LysLeu: 7.752 ± 1.514
3.359LysMet: 3.359 ± 1.094
4.393LysAsn: 4.393 ± 1.072
2.326LysPro: 2.326 ± 0.652
4.91LysGln: 4.91 ± 1.032
6.718LysArg: 6.718 ± 1.227
7.235LysSer: 7.235 ± 0.955
8.01LysThr: 8.01 ± 1.513
5.685LysVal: 5.685 ± 0.859
0.258LysTrp: 0.258 ± 0.241
2.584LysTyr: 2.584 ± 0.594
0.0LysXaa: 0.0 ± 0.0
Leu
6.718LeuAla: 6.718 ± 1.204
0.258LeuCys: 0.258 ± 0.278
8.786LeuAsp: 8.786 ± 1.35
11.37LeuGlu: 11.37 ± 1.736
4.651LeuPhe: 4.651 ± 0.932
5.685LeuGly: 5.685 ± 0.886
1.034LeuHis: 1.034 ± 0.689
8.01LeuIle: 8.01 ± 2.162
10.078LeuLys: 10.078 ± 1.258
9.044LeuLeu: 9.044 ± 1.437
1.809LeuMet: 1.809 ± 0.576
4.134LeuAsn: 4.134 ± 1.183
2.067LeuPro: 2.067 ± 0.589
5.685LeuGln: 5.685 ± 1.085
4.651LeuArg: 4.651 ± 1.429
7.752LeuSer: 7.752 ± 1.129
5.168LeuThr: 5.168 ± 1.063
5.943LeuVal: 5.943 ± 1.302
0.517LeuTrp: 0.517 ± 0.297
3.101LeuTyr: 3.101 ± 0.709
0.0LeuXaa: 0.0 ± 0.0
Met
1.809MetAla: 1.809 ± 0.706
0.0MetCys: 0.0 ± 0.0
1.809MetAsp: 1.809 ± 0.624
2.584MetGlu: 2.584 ± 1.028
0.775MetPhe: 0.775 ± 0.441
0.258MetGly: 0.258 ± 0.271
0.0MetHis: 0.0 ± 0.0
0.517MetIle: 0.517 ± 0.332
2.326MetLys: 2.326 ± 0.745
1.55MetLeu: 1.55 ± 0.648
0.517MetMet: 0.517 ± 0.341
1.55MetAsn: 1.55 ± 0.709
0.258MetPro: 0.258 ± 0.236
0.775MetGln: 0.775 ± 0.539
2.584MetArg: 2.584 ± 0.727
1.034MetSer: 1.034 ± 0.696
2.326MetThr: 2.326 ± 0.724
0.775MetVal: 0.775 ± 0.452
0.0MetTrp: 0.0 ± 0.0
0.258MetTyr: 0.258 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
3.876AsnAla: 3.876 ± 1.057
0.0AsnCys: 0.0 ± 0.0
2.067AsnAsp: 2.067 ± 0.566
3.359AsnGlu: 3.359 ± 1.086
3.101AsnPhe: 3.101 ± 0.82
2.326AsnGly: 2.326 ± 0.792
1.55AsnHis: 1.55 ± 0.643
2.326AsnIle: 2.326 ± 0.684
5.426AsnLys: 5.426 ± 1.198
6.718AsnLeu: 6.718 ± 1.326
1.034AsnMet: 1.034 ± 0.665
3.101AsnAsn: 3.101 ± 0.693
1.292AsnPro: 1.292 ± 0.561
2.326AsnGln: 2.326 ± 0.753
3.618AsnArg: 3.618 ± 0.879
2.584AsnSer: 2.584 ± 0.477
3.876AsnThr: 3.876 ± 1.489
1.809AsnVal: 1.809 ± 0.967
0.0AsnTrp: 0.0 ± 0.0
2.584AsnTyr: 2.584 ± 0.878
0.0AsnXaa: 0.0 ± 0.0
Pro
0.258ProAla: 0.258 ± 0.219
0.0ProCys: 0.0 ± 0.0
0.517ProAsp: 0.517 ± 0.388
3.101ProGlu: 3.101 ± 0.721
1.034ProPhe: 1.034 ± 0.512
0.517ProGly: 0.517 ± 0.339
0.258ProHis: 0.258 ± 0.237
0.517ProIle: 0.517 ± 0.342
3.101ProLys: 3.101 ± 0.936
2.067ProLeu: 2.067 ± 0.848
0.0ProMet: 0.0 ± 0.0
1.809ProAsn: 1.809 ± 0.697
0.258ProPro: 0.258 ± 0.241
1.034ProGln: 1.034 ± 0.792
1.292ProArg: 1.292 ± 0.674
0.517ProSer: 0.517 ± 0.374
1.55ProThr: 1.55 ± 0.548
0.258ProVal: 0.258 ± 0.229
0.0ProTrp: 0.0 ± 0.0
1.55ProTyr: 1.55 ± 0.543
0.0ProXaa: 0.0 ± 0.0
Gln
5.685GlnAla: 5.685 ± 1.423
0.258GlnCys: 0.258 ± 0.236
1.55GlnAsp: 1.55 ± 0.628
4.651GlnGlu: 4.651 ± 1.032
1.55GlnPhe: 1.55 ± 0.597
1.034GlnGly: 1.034 ± 0.6
0.775GlnHis: 0.775 ± 0.353
3.101GlnIle: 3.101 ± 0.813
4.134GlnLys: 4.134 ± 1.158
5.685GlnLeu: 5.685 ± 0.926
1.292GlnMet: 1.292 ± 0.764
1.809GlnAsn: 1.809 ± 0.688
0.258GlnPro: 0.258 ± 0.241
2.842GlnGln: 2.842 ± 0.744
1.55GlnArg: 1.55 ± 0.804
2.067GlnSer: 2.067 ± 0.564
2.842GlnThr: 2.842 ± 0.737
4.393GlnVal: 4.393 ± 0.636
0.258GlnTrp: 0.258 ± 0.264
1.292GlnTyr: 1.292 ± 0.528
0.0GlnXaa: 0.0 ± 0.0
Arg
2.584ArgAla: 2.584 ± 0.825
0.517ArgCys: 0.517 ± 0.35
2.326ArgAsp: 2.326 ± 0.5
4.134ArgGlu: 4.134 ± 1.048
2.326ArgPhe: 2.326 ± 0.579
1.034ArgGly: 1.034 ± 0.596
0.517ArgHis: 0.517 ± 0.346
4.134ArgIle: 4.134 ± 0.917
5.426ArgLys: 5.426 ± 1.32
5.426ArgLeu: 5.426 ± 1.062
0.775ArgMet: 0.775 ± 0.395
4.134ArgAsn: 4.134 ± 0.788
0.775ArgPro: 0.775 ± 0.407
2.326ArgGln: 2.326 ± 0.661
2.067ArgArg: 2.067 ± 0.63
1.809ArgSer: 1.809 ± 0.573
3.101ArgThr: 3.101 ± 1.023
0.775ArgVal: 0.775 ± 0.55
0.258ArgTrp: 0.258 ± 0.249
3.359ArgTyr: 3.359 ± 1.1
0.0ArgXaa: 0.0 ± 0.0
Ser
2.067SerAla: 2.067 ± 0.811
0.0SerCys: 0.0 ± 0.0
4.393SerAsp: 4.393 ± 1.005
4.134SerGlu: 4.134 ± 0.825
3.618SerPhe: 3.618 ± 1.002
3.618SerGly: 3.618 ± 0.829
1.292SerHis: 1.292 ± 0.503
3.876SerIle: 3.876 ± 1.514
5.685SerLys: 5.685 ± 1.282
7.494SerLeu: 7.494 ± 1.27
2.584SerMet: 2.584 ± 0.73
1.809SerAsn: 1.809 ± 0.582
2.842SerPro: 2.842 ± 0.96
2.326SerGln: 2.326 ± 0.626
2.842SerArg: 2.842 ± 0.824
4.651SerSer: 4.651 ± 1.218
3.876SerThr: 3.876 ± 0.744
3.101SerVal: 3.101 ± 0.891
1.55SerTrp: 1.55 ± 0.753
2.584SerTyr: 2.584 ± 0.72
0.0SerXaa: 0.0 ± 0.0
Thr
2.842ThrAla: 2.842 ± 1.006
0.775ThrCys: 0.775 ± 0.411
2.326ThrAsp: 2.326 ± 0.712
4.651ThrGlu: 4.651 ± 1.071
2.326ThrPhe: 2.326 ± 0.74
4.134ThrGly: 4.134 ± 0.792
0.775ThrHis: 0.775 ± 0.503
5.426ThrIle: 5.426 ± 1.163
5.426ThrLys: 5.426 ± 1.124
9.302ThrLeu: 9.302 ± 1.5
1.292ThrMet: 1.292 ± 0.451
2.842ThrAsn: 2.842 ± 0.908
1.809ThrPro: 1.809 ± 0.692
2.067ThrGln: 2.067 ± 0.693
2.326ThrArg: 2.326 ± 1.379
2.842ThrSer: 2.842 ± 0.883
3.359ThrThr: 3.359 ± 0.932
5.426ThrVal: 5.426 ± 1.25
0.517ThrTrp: 0.517 ± 0.336
0.517ThrTyr: 0.517 ± 0.304
0.0ThrXaa: 0.0 ± 0.0
Val
4.393ValAla: 4.393 ± 1.564
0.258ValCys: 0.258 ± 0.229
4.134ValAsp: 4.134 ± 1.405
3.618ValGlu: 3.618 ± 0.971
1.55ValPhe: 1.55 ± 0.828
1.034ValGly: 1.034 ± 0.443
0.517ValHis: 0.517 ± 0.312
4.134ValIle: 4.134 ± 0.883
5.426ValLys: 5.426 ± 1.177
4.91ValLeu: 4.91 ± 1.283
0.775ValMet: 0.775 ± 0.389
3.359ValAsn: 3.359 ± 0.948
1.034ValPro: 1.034 ± 0.421
2.326ValGln: 2.326 ± 0.636
2.067ValArg: 2.067 ± 0.634
4.651ValSer: 4.651 ± 0.897
3.359ValThr: 3.359 ± 0.803
3.618ValVal: 3.618 ± 0.916
0.0ValTrp: 0.0 ± 0.0
4.651ValTyr: 4.651 ± 0.937
0.0ValXaa: 0.0 ± 0.0
Trp
0.258TrpAla: 0.258 ± 0.229
0.0TrpCys: 0.0 ± 0.0
0.258TrpAsp: 0.258 ± 0.265
1.809TrpGlu: 1.809 ± 0.636
0.0TrpPhe: 0.0 ± 0.0
0.258TrpGly: 0.258 ± 0.249
0.0TrpHis: 0.0 ± 0.0
1.034TrpIle: 1.034 ± 0.504
0.517TrpLys: 0.517 ± 0.349
0.517TrpLeu: 0.517 ± 0.472
0.258TrpMet: 0.258 ± 0.288
0.258TrpAsn: 0.258 ± 0.264
0.0TrpPro: 0.0 ± 0.0
0.258TrpGln: 0.258 ± 0.219
0.517TrpArg: 0.517 ± 0.346
0.258TrpSer: 0.258 ± 0.219
0.258TrpThr: 0.258 ± 0.198
0.517TrpVal: 0.517 ± 0.32
0.258TrpTrp: 0.258 ± 0.219
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.775TyrAla: 0.775 ± 0.488
0.0TyrCys: 0.0 ± 0.0
1.809TyrAsp: 1.809 ± 0.673
2.584TyrGlu: 2.584 ± 0.797
2.584TyrPhe: 2.584 ± 0.842
0.517TyrGly: 0.517 ± 0.304
0.0TyrHis: 0.0 ± 0.0
2.067TyrIle: 2.067 ± 0.628
5.168TyrLys: 5.168 ± 1.372
6.202TyrLeu: 6.202 ± 0.979
1.809TyrMet: 1.809 ± 0.642
1.809TyrAsn: 1.809 ± 0.657
0.517TyrPro: 0.517 ± 0.316
2.326TyrGln: 2.326 ± 0.751
2.842TyrArg: 2.842 ± 1.083
3.101TyrSer: 3.101 ± 1.051
1.55TyrThr: 1.55 ± 0.557
1.034TyrVal: 1.034 ± 0.573
1.034TyrTrp: 1.034 ± 0.492
2.584TyrTyr: 2.584 ± 0.812
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (3871 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski