Amino acid dipepetide frequency for Hubei myriapoda virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.242AlaAla: 5.242 ± 1.298
1.542AlaCys: 1.542 ± 0.497
3.7AlaAsp: 3.7 ± 1.34
3.7AlaGlu: 3.7 ± 1.153
3.084AlaPhe: 3.084 ± 1.621
2.467AlaGly: 2.467 ± 0.739
1.85AlaHis: 1.85 ± 0.811
4.934AlaIle: 4.934 ± 1.302
2.775AlaLys: 2.775 ± 0.189
7.092AlaLeu: 7.092 ± 2.53
1.542AlaMet: 1.542 ± 0.182
2.467AlaAsn: 2.467 ± 1.168
3.7AlaPro: 3.7 ± 1.204
1.85AlaGln: 1.85 ± 0.577
1.85AlaArg: 1.85 ± 1.014
7.401AlaSer: 7.401 ± 1.942
4.009AlaThr: 4.009 ± 1.585
6.167AlaVal: 6.167 ± 0.987
0.617AlaTrp: 0.617 ± 0.453
3.084AlaTyr: 3.084 ± 1.286
0.0AlaXaa: 0.0 ± 0.0
Cys
0.925CysAla: 0.925 ± 0.486
0.617CysCys: 0.617 ± 0.338
1.542CysAsp: 1.542 ± 0.182
0.0CysGlu: 0.0 ± 0.0
1.233CysPhe: 1.233 ± 0.425
1.233CysGly: 1.233 ± 0.425
0.617CysHis: 0.617 ± 0.295
0.925CysIle: 0.925 ± 0.406
0.617CysLys: 0.617 ± 0.338
0.617CysLeu: 0.617 ± 0.338
0.308CysMet: 0.308 ± 0.169
1.233CysAsn: 1.233 ± 0.369
2.158CysPro: 2.158 ± 0.811
0.0CysGln: 0.0 ± 0.0
1.233CysArg: 1.233 ± 0.676
0.617CysSer: 0.617 ± 0.651
0.925CysThr: 0.925 ± 0.507
2.158CysVal: 2.158 ± 1.283
0.925CysTrp: 0.925 ± 0.288
0.308CysTyr: 0.308 ± 0.551
0.0CysXaa: 0.0 ± 0.0
Asp
4.934AspAla: 4.934 ± 1.304
1.542AspCys: 1.542 ± 0.497
4.934AspAsp: 4.934 ± 0.79
2.467AspGlu: 2.467 ± 1.352
4.934AspPhe: 4.934 ± 0.79
2.775AspGly: 2.775 ± 0.529
1.542AspHis: 1.542 ± 0.504
4.009AspIle: 4.009 ± 1.702
2.467AspLys: 2.467 ± 0.739
6.167AspLeu: 6.167 ± 0.786
0.617AspMet: 0.617 ± 0.453
1.85AspAsn: 1.85 ± 0.811
3.392AspPro: 3.392 ± 1.429
3.392AspGln: 3.392 ± 1.452
1.542AspArg: 1.542 ± 0.953
3.7AspSer: 3.7 ± 0.754
2.775AspThr: 2.775 ± 0.867
4.934AspVal: 4.934 ± 0.329
0.925AspTrp: 0.925 ± 0.288
3.084AspTyr: 3.084 ± 0.922
0.0AspXaa: 0.0 ± 0.0
Glu
3.392GluAla: 3.392 ± 1.005
0.308GluCys: 0.308 ± 0.169
3.392GluAsp: 3.392 ± 0.956
2.775GluGlu: 2.775 ± 1.175
2.467GluPhe: 2.467 ± 0.851
2.158GluGly: 2.158 ± 0.799
1.233GluHis: 1.233 ± 0.676
3.084GluIle: 3.084 ± 1.116
3.7GluLys: 3.7 ± 1.538
7.709GluLeu: 7.709 ± 1.017
1.233GluMet: 1.233 ± 0.425
2.775GluAsn: 2.775 ± 0.54
2.467GluPro: 2.467 ± 0.555
0.925GluGln: 0.925 ± 0.507
2.775GluArg: 2.775 ± 1.058
3.7GluSer: 3.7 ± 1.108
2.158GluThr: 2.158 ± 0.641
2.467GluVal: 2.467 ± 0.555
1.233GluTrp: 1.233 ± 0.996
2.158GluTyr: 2.158 ± 0.641
0.0GluXaa: 0.0 ± 0.0
Phe
2.158PheAla: 2.158 ± 0.811
1.233PheCys: 1.233 ± 0.326
3.392PheAsp: 3.392 ± 0.474
1.542PheGlu: 1.542 ± 0.497
2.158PhePhe: 2.158 ± 1.283
3.084PheGly: 3.084 ± 1.009
2.467PheHis: 2.467 ± 0.165
1.85PheIle: 1.85 ± 1.404
2.158PheLys: 2.158 ± 1.183
4.625PheLeu: 4.625 ± 1.046
1.85PheMet: 1.85 ± 0.426
0.925PheAsn: 0.925 ± 0.406
3.084PhePro: 3.084 ± 2.272
2.775PheGln: 2.775 ± 0.529
2.467PheArg: 2.467 ± 1.244
3.7PheSer: 3.7 ± 0.261
2.775PheThr: 2.775 ± 1.217
4.009PheVal: 4.009 ± 1.354
2.467PheTrp: 2.467 ± 1.244
1.542PheTyr: 1.542 ± 1.137
0.0PheXaa: 0.0 ± 0.0
Gly
4.009GlyAla: 4.009 ± 0.859
1.233GlyCys: 1.233 ± 0.326
2.467GlyAsp: 2.467 ± 0.165
2.775GlyGlu: 2.775 ± 0.5
3.7GlyPhe: 3.7 ± 0.934
1.233GlyGly: 1.233 ± 0.59
0.925GlyHis: 0.925 ± 0.507
3.7GlyIle: 3.7 ± 1.058
3.084GlyLys: 3.084 ± 0.923
3.392GlyLeu: 3.392 ± 0.826
1.85GlyMet: 1.85 ± 0.343
3.392GlyAsn: 3.392 ± 1.122
2.775GlyPro: 2.775 ± 0.865
1.85GlyGln: 1.85 ± 0.644
2.775GlyArg: 2.775 ± 0.189
2.467GlySer: 2.467 ± 0.395
4.934GlyThr: 4.934 ± 0.99
4.009GlyVal: 4.009 ± 1.784
0.925GlyTrp: 0.925 ± 0.288
1.542GlyTyr: 1.542 ± 0.558
0.0GlyXaa: 0.0 ± 0.0
His
2.775HisAla: 2.775 ± 0.529
0.925HisCys: 0.925 ± 0.406
1.542HisAsp: 1.542 ± 0.182
1.233HisGlu: 1.233 ± 0.906
0.308HisPhe: 0.308 ± 0.551
1.233HisGly: 1.233 ± 0.326
1.233HisHis: 1.233 ± 0.906
0.617HisIle: 0.617 ± 0.295
2.158HisLys: 2.158 ± 0.757
3.392HisLeu: 3.392 ± 0.956
0.925HisMet: 0.925 ± 0.486
0.308HisAsn: 0.308 ± 0.385
3.084HisPro: 3.084 ± 1.039
0.925HisGln: 0.925 ± 0.406
0.925HisArg: 0.925 ± 0.288
1.85HisSer: 1.85 ± 0.426
0.925HisThr: 0.925 ± 1.143
1.85HisVal: 1.85 ± 1.014
0.0HisTrp: 0.0 ± 0.0
0.617HisTyr: 0.617 ± 0.338
0.0HisXaa: 0.0 ± 0.0
Ile
5.242IleAla: 5.242 ± 1.189
0.925IleCys: 0.925 ± 0.507
4.625IleAsp: 4.625 ± 0.632
6.167IleGlu: 6.167 ± 0.992
3.7IlePhe: 3.7 ± 1.276
2.775IleGly: 2.775 ± 1.058
0.617IleHis: 0.617 ± 0.453
2.467IleIle: 2.467 ± 0.656
2.467IleLys: 2.467 ± 0.165
6.784IleLeu: 6.784 ± 0.691
1.542IleMet: 1.542 ± 0.497
1.85IleAsn: 1.85 ± 0.62
4.009IlePro: 4.009 ± 0.696
1.233IleGln: 1.233 ± 0.326
3.7IleArg: 3.7 ± 0.47
2.158IleSer: 2.158 ± 0.275
3.084IleThr: 3.084 ± 0.459
4.625IleVal: 4.625 ± 1.046
0.308IleTrp: 0.308 ± 0.169
1.233IleTyr: 1.233 ± 0.369
0.0IleXaa: 0.0 ± 0.0
Lys
1.542LysAla: 1.542 ± 0.845
0.925LysCys: 0.925 ± 0.406
3.392LysAsp: 3.392 ± 1.137
4.317LysGlu: 4.317 ± 1.598
1.85LysPhe: 1.85 ± 0.62
1.542LysGly: 1.542 ± 0.182
2.158LysHis: 2.158 ± 0.628
4.625LysIle: 4.625 ± 0.546
0.925LysLys: 0.925 ± 0.486
4.317LysLeu: 4.317 ± 0.997
2.467LysMet: 2.467 ± 0.765
2.158LysAsn: 2.158 ± 0.799
2.467LysPro: 2.467 ± 0.656
1.233LysGln: 1.233 ± 0.425
2.467LysArg: 2.467 ± 0.959
2.467LysSer: 2.467 ± 0.165
3.392LysThr: 3.392 ± 1.137
3.392LysVal: 3.392 ± 0.433
0.617LysTrp: 0.617 ± 0.295
2.158LysTyr: 2.158 ± 0.628
0.0LysXaa: 0.0 ± 0.0
Leu
6.475LeuAla: 6.475 ± 2.096
1.233LeuCys: 1.233 ± 1.542
5.859LeuAsp: 5.859 ± 1.96
4.625LeuGlu: 4.625 ± 0.632
4.625LeuPhe: 4.625 ± 3.25
6.475LeuGly: 6.475 ± 1.296
2.158LeuHis: 2.158 ± 0.814
5.55LeuIle: 5.55 ± 1.043
7.092LeuLys: 7.092 ± 0.202
10.484LeuLeu: 10.484 ± 1.308
2.775LeuMet: 2.775 ± 1.227
5.859LeuAsn: 5.859 ± 0.588
3.084LeuPro: 3.084 ± 0.459
4.009LeuGln: 4.009 ± 0.72
7.709LeuArg: 7.709 ± 0.865
8.326LeuSer: 8.326 ± 3.526
3.7LeuThr: 3.7 ± 0.636
4.625LeuVal: 4.625 ± 0.756
0.925LeuTrp: 0.925 ± 0.507
3.392LeuTyr: 3.392 ± 0.268
0.0LeuXaa: 0.0 ± 0.0
Met
2.775MetAla: 2.775 ± 0.5
0.308MetCys: 0.308 ± 0.169
1.542MetAsp: 1.542 ± 0.844
0.617MetGlu: 0.617 ± 0.338
0.925MetPhe: 0.925 ± 0.507
0.925MetGly: 0.925 ± 0.486
0.0MetHis: 0.0 ± 0.0
0.925MetIle: 0.925 ± 0.288
2.158MetLys: 2.158 ± 0.799
1.85MetLeu: 1.85 ± 0.577
0.617MetMet: 0.617 ± 0.338
1.85MetAsn: 1.85 ± 1.014
0.617MetPro: 0.617 ± 0.295
0.925MetGln: 0.925 ± 0.486
2.158MetArg: 2.158 ± 0.757
1.85MetSer: 1.85 ± 0.62
2.158MetThr: 2.158 ± 1.468
0.617MetVal: 0.617 ± 0.338
0.925MetTrp: 0.925 ± 0.664
1.233MetTyr: 1.233 ± 0.326
0.0MetXaa: 0.0 ± 0.0
Asn
4.625AsnAla: 4.625 ± 1.442
0.0AsnCys: 0.0 ± 0.0
1.85AsnAsp: 1.85 ± 0.577
1.85AsnGlu: 1.85 ± 0.426
3.392AsnPhe: 3.392 ± 0.914
2.467AsnGly: 2.467 ± 1.352
0.617AsnHis: 0.617 ± 0.453
3.084AsnIle: 3.084 ± 1.947
1.542AsnLys: 1.542 ± 0.497
6.167AsnLeu: 6.167 ± 0.098
0.925AsnMet: 0.925 ± 0.486
1.233AsnAsn: 1.233 ± 0.676
2.467AsnPro: 2.467 ± 0.851
1.542AsnGln: 1.542 ± 0.558
1.542AsnArg: 1.542 ± 0.845
2.775AsnSer: 2.775 ± 0.859
1.542AsnThr: 1.542 ± 0.497
3.084AsnVal: 3.084 ± 0.922
0.617AsnTrp: 0.617 ± 0.338
0.617AsnTyr: 0.617 ± 0.338
0.0AsnXaa: 0.0 ± 0.0
Pro
6.167ProAla: 6.167 ± 2.345
0.0ProCys: 0.0 ± 0.0
4.009ProAsp: 4.009 ± 1.21
3.392ProGlu: 3.392 ± 0.474
3.084ProPhe: 3.084 ± 0.724
4.317ProGly: 4.317 ± 0.679
1.233ProHis: 1.233 ± 0.425
3.084ProIle: 3.084 ± 0.698
1.85ProLys: 1.85 ± 0.426
5.55ProLeu: 5.55 ± 1.611
0.617ProMet: 0.617 ± 0.338
2.467ProAsn: 2.467 ± 0.652
3.084ProPro: 3.084 ± 1.671
2.467ProGln: 2.467 ± 0.395
2.158ProArg: 2.158 ± 0.757
2.775ProSer: 2.775 ± 1.459
3.392ProThr: 3.392 ± 1.808
4.009ProVal: 4.009 ± 1.585
0.308ProTrp: 0.308 ± 0.169
0.308ProTyr: 0.308 ± 0.169
0.0ProXaa: 0.0 ± 0.0
Gln
0.308GlnAla: 0.308 ± 0.169
0.308GlnCys: 0.308 ± 0.551
1.85GlnAsp: 1.85 ± 0.644
1.542GlnGlu: 1.542 ± 0.497
1.233GlnPhe: 1.233 ± 0.326
1.85GlnGly: 1.85 ± 0.577
1.233GlnHis: 1.233 ± 0.369
2.158GlnIle: 2.158 ± 0.79
1.542GlnLys: 1.542 ± 0.497
4.625GlnLeu: 4.625 ± 1.34
0.308GlnMet: 0.308 ± 0.169
2.775GlnAsn: 2.775 ± 0.189
1.542GlnPro: 1.542 ± 0.182
1.233GlnGln: 1.233 ± 0.425
1.233GlnArg: 1.233 ± 0.425
2.775GlnSer: 2.775 ± 1.058
1.85GlnThr: 1.85 ± 0.577
4.009GlnVal: 4.009 ± 0.801
0.308GlnTrp: 0.308 ± 0.169
1.542GlnTyr: 1.542 ± 0.182
0.0GlnXaa: 0.0 ± 0.0
Arg
2.467ArgAla: 2.467 ± 1.18
0.617ArgCys: 0.617 ± 0.295
3.7ArgAsp: 3.7 ± 1.538
3.7ArgGlu: 3.7 ± 0.47
1.85ArgPhe: 1.85 ± 0.577
2.775ArgGly: 2.775 ± 0.865
1.233ArgHis: 1.233 ± 0.996
4.625ArgIle: 4.625 ± 1.144
2.467ArgLys: 2.467 ± 0.165
4.009ArgLeu: 4.009 ± 0.943
1.542ArgMet: 1.542 ± 0.845
1.542ArgAsn: 1.542 ± 0.845
3.084ArgPro: 3.084 ± 1.286
2.158ArgGln: 2.158 ± 0.757
4.009ArgArg: 4.009 ± 0.943
4.934ArgSer: 4.934 ± 0.317
3.392ArgThr: 3.392 ± 0.268
4.625ArgVal: 4.625 ± 0.15
0.308ArgTrp: 0.308 ± 0.551
1.542ArgTyr: 1.542 ± 0.586
0.0ArgXaa: 0.0 ± 0.0
Ser
6.167SerAla: 6.167 ± 1.561
0.925SerCys: 0.925 ± 0.507
6.167SerAsp: 6.167 ± 1.099
5.55SerGlu: 5.55 ± 0.392
4.009SerPhe: 4.009 ± 0.943
4.009SerGly: 4.009 ± 1.207
2.775SerHis: 2.775 ± 0.918
4.934SerIle: 4.934 ± 0.962
1.542SerLys: 1.542 ± 0.504
8.017SerLeu: 8.017 ± 0.796
1.542SerMet: 1.542 ± 0.497
2.158SerAsn: 2.158 ± 1.283
2.467SerPro: 2.467 ± 0.165
1.542SerGln: 1.542 ± 1.089
5.242SerArg: 5.242 ± 0.501
4.625SerSer: 4.625 ± 1.442
4.317SerThr: 4.317 ± 1.049
5.55SerVal: 5.55 ± 3.163
0.617SerTrp: 0.617 ± 0.338
1.85SerTyr: 1.85 ± 0.426
0.0SerXaa: 0.0 ± 0.0
Thr
3.392ThrAla: 3.392 ± 1.136
1.85ThrCys: 1.85 ± 0.811
2.158ThrAsp: 2.158 ± 0.799
1.542ThrGlu: 1.542 ± 0.182
1.85ThrPhe: 1.85 ± 0.973
4.934ThrGly: 4.934 ± 1.98
1.542ThrHis: 1.542 ± 1.137
3.084ThrIle: 3.084 ± 1.039
2.467ThrLys: 2.467 ± 1.036
3.392ThrLeu: 3.392 ± 1.831
0.925ThrMet: 0.925 ± 0.994
2.158ThrAsn: 2.158 ± 0.241
3.7ThrPro: 3.7 ± 0.361
0.925ThrGln: 0.925 ± 0.288
2.467ThrArg: 2.467 ± 0.555
7.092ThrSer: 7.092 ± 0.531
4.625ThrThr: 4.625 ± 0.656
4.625ThrVal: 4.625 ± 1.442
2.158ThrTrp: 2.158 ± 1.306
1.85ThrTyr: 1.85 ± 0.644
0.0ThrXaa: 0.0 ± 0.0
Val
5.242ValAla: 5.242 ± 1.485
2.467ValCys: 2.467 ± 1.036
1.85ValAsp: 1.85 ± 0.644
2.775ValGlu: 2.775 ± 0.558
4.317ValPhe: 4.317 ± 1.355
4.317ValGly: 4.317 ± 0.036
2.775ValHis: 2.775 ± 1.459
2.775ValIle: 2.775 ± 0.859
3.7ValLys: 3.7 ± 0.636
6.475ValLeu: 6.475 ± 1.161
2.467ValMet: 2.467 ± 0.959
2.467ValAsn: 2.467 ± 0.838
5.55ValPro: 5.55 ± 0.778
4.009ValGln: 4.009 ± 0.801
6.167ValArg: 6.167 ± 1.847
5.859ValSer: 5.859 ± 1.433
3.7ValThr: 3.7 ± 1.34
5.55ValVal: 5.55 ± 0.778
0.617ValTrp: 0.617 ± 0.295
1.233ValTyr: 1.233 ± 0.425
0.0ValXaa: 0.0 ± 0.0
Trp
0.617TrpAla: 0.617 ± 0.295
0.308TrpCys: 0.308 ± 0.385
0.308TrpAsp: 0.308 ± 0.551
0.308TrpGlu: 0.308 ± 0.385
0.617TrpPhe: 0.617 ± 0.295
0.925TrpGly: 0.925 ± 0.288
0.308TrpHis: 0.308 ± 0.385
1.85TrpIle: 1.85 ± 0.131
1.542TrpLys: 1.542 ± 0.504
0.925TrpLeu: 0.925 ± 0.406
0.617TrpMet: 0.617 ± 0.295
1.85TrpAsn: 1.85 ± 1.014
0.617TrpPro: 0.617 ± 0.295
0.0TrpGln: 0.0 ± 0.0
0.308TrpArg: 0.308 ± 0.385
1.542TrpSer: 1.542 ± 0.586
0.925TrpThr: 0.925 ± 0.994
1.233TrpVal: 1.233 ± 0.906
0.0TrpTrp: 0.0 ± 0.0
0.617TrpTyr: 0.617 ± 0.338
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.617TyrAla: 0.617 ± 0.338
0.925TyrCys: 0.925 ± 0.507
3.7TyrAsp: 3.7 ± 0.47
1.233TyrGlu: 1.233 ± 0.425
0.925TyrPhe: 0.925 ± 0.406
1.85TyrGly: 1.85 ± 0.936
0.308TyrHis: 0.308 ± 0.385
1.233TyrIle: 1.233 ± 0.676
2.158TyrLys: 2.158 ± 0.757
3.392TyrLeu: 3.392 ± 1.307
0.0TyrMet: 0.0 ± 0.0
0.617TyrAsn: 0.617 ± 0.338
0.925TyrPro: 0.925 ± 0.507
1.233TyrGln: 1.233 ± 0.369
1.85TyrArg: 1.85 ± 0.644
3.392TyrSer: 3.392 ± 0.53
2.158TyrThr: 2.158 ± 1.283
2.775TyrVal: 2.775 ± 0.865
0.617TyrTrp: 0.617 ± 0.453
1.233TyrTyr: 1.233 ± 0.326
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3244 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski