Amino acid dipepetide frequency for Apis mellifera associated microvirus 49

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.081AlaAla: 13.081 ± 4.948
0.727AlaCys: 0.727 ± 0.605
2.18AlaAsp: 2.18 ± 1.388
10.174AlaGlu: 10.174 ± 6.097
3.634AlaPhe: 3.634 ± 0.679
8.721AlaGly: 8.721 ± 4.542
1.453AlaHis: 1.453 ± 0.979
3.634AlaIle: 3.634 ± 0.883
2.907AlaLys: 2.907 ± 1.9
9.448AlaLeu: 9.448 ± 3.563
2.18AlaMet: 2.18 ± 0.763
2.18AlaAsn: 2.18 ± 1.004
7.994AlaPro: 7.994 ± 2.007
4.36AlaGln: 4.36 ± 2.166
7.994AlaArg: 7.994 ± 2.06
10.174AlaSer: 10.174 ± 2.556
6.541AlaThr: 6.541 ± 1.973
9.448AlaVal: 9.448 ± 3.46
0.727AlaTrp: 0.727 ± 0.76
3.634AlaTyr: 3.634 ± 0.997
0.0AlaXaa: 0.0 ± 0.0
Cys
1.453CysAla: 1.453 ± 0.631
0.727CysCys: 0.727 ± 0.605
0.727CysAsp: 0.727 ± 0.521
0.0CysGlu: 0.0 ± 0.0
0.727CysPhe: 0.727 ± 0.605
0.727CysGly: 0.727 ± 0.605
0.727CysHis: 0.727 ± 0.605
0.0CysIle: 0.0 ± 0.0
0.727CysLys: 0.727 ± 0.605
1.453CysLeu: 1.453 ± 1.21
0.727CysMet: 0.727 ± 0.789
0.0CysAsn: 0.0 ± 0.0
0.727CysPro: 0.727 ± 0.605
0.0CysGln: 0.0 ± 0.0
1.453CysArg: 1.453 ± 1.21
0.727CysSer: 0.727 ± 0.521
0.0CysThr: 0.0 ± 0.0
0.727CysVal: 0.727 ± 0.877
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.814AspAla: 5.814 ± 1.796
0.0AspCys: 0.0 ± 0.0
2.907AspAsp: 2.907 ± 1.958
4.36AspGlu: 4.36 ± 2.863
2.18AspPhe: 2.18 ± 1.348
5.087AspGly: 5.087 ± 2.412
0.0AspHis: 0.0 ± 0.0
1.453AspIle: 1.453 ± 0.834
0.0AspLys: 0.0 ± 0.0
5.087AspLeu: 5.087 ± 1.169
1.453AspMet: 1.453 ± 0.631
2.18AspAsn: 2.18 ± 1.477
3.634AspPro: 3.634 ± 1.147
2.18AspGln: 2.18 ± 0.959
3.634AspArg: 3.634 ± 0.823
2.18AspSer: 2.18 ± 0.936
1.453AspThr: 1.453 ± 0.678
3.634AspVal: 3.634 ± 1.606
2.907AspTrp: 2.907 ± 1.005
1.453AspTyr: 1.453 ± 1.042
0.0AspXaa: 0.0 ± 0.0
Glu
7.267GluAla: 7.267 ± 2.419
0.0GluCys: 0.0 ± 0.0
5.087GluAsp: 5.087 ± 3.231
4.36GluGlu: 4.36 ± 2.696
3.634GluPhe: 3.634 ± 0.823
0.727GluGly: 0.727 ± 0.76
0.727GluHis: 0.727 ± 0.521
2.18GluIle: 2.18 ± 0.77
5.087GluLys: 5.087 ± 2.201
6.541GluLeu: 6.541 ± 2.829
0.0GluMet: 0.0 ± 0.0
2.18GluAsn: 2.18 ± 1.004
1.453GluPro: 1.453 ± 0.979
1.453GluGln: 1.453 ± 0.631
5.087GluArg: 5.087 ± 2.948
0.727GluSer: 0.727 ± 0.605
1.453GluThr: 1.453 ± 0.631
5.087GluVal: 5.087 ± 3.474
0.727GluTrp: 0.727 ± 0.765
4.36GluTyr: 4.36 ± 1.973
0.0GluXaa: 0.0 ± 0.0
Phe
6.541PheAla: 6.541 ± 2.179
0.0PheCys: 0.0 ± 0.0
2.18PheAsp: 2.18 ± 0.77
2.18PheGlu: 2.18 ± 1.564
2.18PhePhe: 2.18 ± 0.763
3.634PheGly: 3.634 ± 1.55
0.727PheHis: 0.727 ± 0.877
0.727PheIle: 0.727 ± 0.521
0.727PheLys: 0.727 ± 0.76
0.727PheLeu: 0.727 ± 0.76
1.453PheMet: 1.453 ± 0.685
0.727PheAsn: 0.727 ± 0.605
1.453PhePro: 1.453 ± 1.042
1.453PheGln: 1.453 ± 1.253
3.634PheArg: 3.634 ± 1.572
0.727PheSer: 0.727 ± 0.521
1.453PheThr: 1.453 ± 0.878
3.634PheVal: 3.634 ± 1.008
0.727PheTrp: 0.727 ± 0.521
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.541GlyAla: 6.541 ± 2.292
1.453GlyCys: 1.453 ± 1.21
5.087GlyAsp: 5.087 ± 1.825
5.087GlyGlu: 5.087 ± 2.199
2.18GlyPhe: 2.18 ± 1.353
3.634GlyGly: 3.634 ± 1.198
0.727GlyHis: 0.727 ± 0.521
3.634GlyIle: 3.634 ± 1.214
5.087GlyLys: 5.087 ± 2.232
5.087GlyLeu: 5.087 ± 2.569
1.453GlyMet: 1.453 ± 1.52
3.634GlyAsn: 3.634 ± 1.147
2.907GlyPro: 2.907 ± 1.066
4.36GlyGln: 4.36 ± 0.914
6.541GlyArg: 6.541 ± 3.073
4.36GlySer: 4.36 ± 1.86
5.087GlyThr: 5.087 ± 1.609
5.814GlyVal: 5.814 ± 2.073
0.0GlyTrp: 0.0 ± 0.0
1.453GlyTyr: 1.453 ± 1.042
0.0GlyXaa: 0.0 ± 0.0
His
2.18HisAla: 2.18 ± 0.936
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.727HisPhe: 0.727 ± 0.521
2.18HisGly: 2.18 ± 0.987
0.727HisHis: 0.727 ± 0.521
0.0HisIle: 0.0 ± 0.0
0.727HisLys: 0.727 ± 0.605
1.453HisLeu: 1.453 ± 0.678
0.0HisMet: 0.0 ± 0.0
0.727HisAsn: 0.727 ± 0.765
3.634HisPro: 3.634 ± 1.241
0.0HisGln: 0.0 ± 0.0
1.453HisArg: 1.453 ± 0.631
0.727HisSer: 0.727 ± 0.521
1.453HisThr: 1.453 ± 0.631
1.453HisVal: 1.453 ± 0.979
0.727HisTrp: 0.727 ± 0.521
0.727HisTyr: 0.727 ± 0.605
0.0HisXaa: 0.0 ± 0.0
Ile
3.634IleAla: 3.634 ± 1.967
0.727IleCys: 0.727 ± 0.605
0.727IleAsp: 0.727 ± 0.765
2.907IleGlu: 2.907 ± 1.556
0.727IlePhe: 0.727 ± 0.877
6.541IleGly: 6.541 ± 2.671
2.18IleHis: 2.18 ± 0.632
2.18IleIle: 2.18 ± 1.388
1.453IleLys: 1.453 ± 0.834
2.907IleLeu: 2.907 ± 0.943
0.727IleMet: 0.727 ± 0.578
2.18IleAsn: 2.18 ± 0.987
1.453IlePro: 1.453 ± 0.722
2.18IleGln: 2.18 ± 1.564
4.36IleArg: 4.36 ± 1.672
2.907IleSer: 2.907 ± 1.715
2.907IleThr: 2.907 ± 2.085
1.453IleVal: 1.453 ± 1.753
0.727IleTrp: 0.727 ± 0.521
2.907IleTyr: 2.907 ± 1.446
0.0IleXaa: 0.0 ± 0.0
Lys
6.541LysAla: 6.541 ± 3.002
0.0LysCys: 0.0 ± 0.0
2.907LysAsp: 2.907 ± 1.062
2.18LysGlu: 2.18 ± 0.936
0.727LysPhe: 0.727 ± 0.521
2.18LysGly: 2.18 ± 0.763
0.727LysHis: 0.727 ± 0.521
2.18LysIle: 2.18 ± 1.436
1.453LysLys: 1.453 ± 1.21
0.727LysLeu: 0.727 ± 0.521
2.18LysMet: 2.18 ± 1.815
0.0LysAsn: 0.0 ± 0.0
1.453LysPro: 1.453 ± 1.21
1.453LysGln: 1.453 ± 1.253
5.087LysArg: 5.087 ± 2.201
2.18LysSer: 2.18 ± 0.632
2.907LysThr: 2.907 ± 0.804
2.18LysVal: 2.18 ± 2.63
0.0LysTrp: 0.0 ± 0.0
1.453LysTyr: 1.453 ± 0.722
0.0LysXaa: 0.0 ± 0.0
Leu
10.901LeuAla: 10.901 ± 3.022
0.727LeuCys: 0.727 ± 0.605
1.453LeuAsp: 1.453 ± 1.042
5.814LeuGlu: 5.814 ± 1.352
0.727LeuPhe: 0.727 ± 0.605
8.721LeuGly: 8.721 ± 2.18
0.0LeuHis: 0.0 ± 0.0
5.814LeuIle: 5.814 ± 1.515
0.727LeuLys: 0.727 ± 0.605
4.36LeuLeu: 4.36 ± 1.498
1.453LeuMet: 1.453 ± 0.834
4.36LeuAsn: 4.36 ± 0.776
5.814LeuPro: 5.814 ± 1.655
6.541LeuGln: 6.541 ± 2.164
8.721LeuArg: 8.721 ± 1.48
4.36LeuSer: 4.36 ± 0.776
5.087LeuThr: 5.087 ± 1.175
2.907LeuVal: 2.907 ± 1.427
0.727LeuTrp: 0.727 ± 0.605
2.18LeuTyr: 2.18 ± 1.569
0.0LeuXaa: 0.0 ± 0.0
Met
0.727MetAla: 0.727 ± 0.76
0.0MetCys: 0.0 ± 0.0
1.453MetAsp: 1.453 ± 0.722
2.18MetGlu: 2.18 ± 2.295
0.727MetPhe: 0.727 ± 0.605
2.18MetGly: 2.18 ± 0.959
1.453MetHis: 1.453 ± 0.678
1.453MetIle: 1.453 ± 0.906
0.727MetLys: 0.727 ± 0.521
0.727MetLeu: 0.727 ± 0.521
0.0MetMet: 0.0 ± 0.0
0.727MetAsn: 0.727 ± 0.605
1.453MetPro: 1.453 ± 1.116
2.18MetGln: 2.18 ± 0.632
2.18MetArg: 2.18 ± 0.632
1.453MetSer: 1.453 ± 0.979
0.727MetThr: 0.727 ± 0.605
1.453MetVal: 1.453 ± 1.52
0.727MetTrp: 0.727 ± 0.605
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.18AsnAla: 2.18 ± 1.121
2.18AsnCys: 2.18 ± 1.121
2.18AsnAsp: 2.18 ± 0.987
1.453AsnGlu: 1.453 ± 0.722
1.453AsnPhe: 1.453 ± 1.042
1.453AsnGly: 1.453 ± 0.722
0.0AsnHis: 0.0 ± 0.0
2.18AsnIle: 2.18 ± 1.267
1.453AsnLys: 1.453 ± 0.722
2.18AsnLeu: 2.18 ± 1.004
0.727AsnMet: 0.727 ± 0.76
0.0AsnAsn: 0.0 ± 0.0
5.814AsnPro: 5.814 ± 1.226
0.727AsnGln: 0.727 ± 0.76
2.907AsnArg: 2.907 ± 0.878
2.18AsnSer: 2.18 ± 1.004
1.453AsnThr: 1.453 ± 0.631
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
1.453AsnTyr: 1.453 ± 0.722
0.0AsnXaa: 0.0 ± 0.0
Pro
8.721ProAla: 8.721 ± 2.691
0.727ProCys: 0.727 ± 0.605
2.907ProAsp: 2.907 ± 0.761
2.907ProGlu: 2.907 ± 0.613
2.18ProPhe: 2.18 ± 0.763
5.814ProGly: 5.814 ± 3.326
2.18ProHis: 2.18 ± 0.77
2.18ProIle: 2.18 ± 1.348
4.36ProLys: 4.36 ± 1.345
5.814ProLeu: 5.814 ± 2.562
2.18ProMet: 2.18 ± 1.004
1.453ProAsn: 1.453 ± 0.631
5.087ProPro: 5.087 ± 2.513
1.453ProGln: 1.453 ± 0.678
2.907ProArg: 2.907 ± 1.281
3.634ProSer: 3.634 ± 1.092
3.634ProThr: 3.634 ± 0.475
6.541ProVal: 6.541 ± 2.814
0.727ProTrp: 0.727 ± 0.521
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.907GlnAla: 2.907 ± 2.118
1.453GlnCys: 1.453 ± 0.631
3.634GlnAsp: 3.634 ± 1.9
0.727GlnGlu: 0.727 ± 0.76
1.453GlnPhe: 1.453 ± 1.042
0.727GlnGly: 0.727 ± 0.605
0.727GlnHis: 0.727 ± 0.521
2.18GlnIle: 2.18 ± 1.696
2.18GlnLys: 2.18 ± 1.004
2.18GlnLeu: 2.18 ± 0.632
0.727GlnMet: 0.727 ± 0.765
0.727GlnAsn: 0.727 ± 0.521
2.18GlnPro: 2.18 ± 1.436
2.18GlnGln: 2.18 ± 1.388
5.087GlnArg: 5.087 ± 1.862
2.18GlnSer: 2.18 ± 0.936
2.907GlnThr: 2.907 ± 1.427
2.18GlnVal: 2.18 ± 1.034
0.727GlnTrp: 0.727 ± 0.521
0.727GlnTyr: 0.727 ± 0.605
0.0GlnXaa: 0.0 ± 0.0
Arg
7.994ArgAla: 7.994 ± 1.407
0.727ArgCys: 0.727 ± 0.521
5.087ArgAsp: 5.087 ± 2.14
4.36ArgGlu: 4.36 ± 2.554
4.36ArgPhe: 4.36 ± 1.343
2.907ArgGly: 2.907 ± 0.613
1.453ArgHis: 1.453 ± 0.631
4.36ArgIle: 4.36 ± 1.444
2.907ArgLys: 2.907 ± 0.998
10.174ArgLeu: 10.174 ± 2.085
2.18ArgMet: 2.18 ± 1.034
2.18ArgAsn: 2.18 ± 0.632
7.994ArgPro: 7.994 ± 2.371
2.907ArgGln: 2.907 ± 1.444
13.081ArgArg: 13.081 ± 8.69
5.814ArgSer: 5.814 ± 2.157
2.18ArgThr: 2.18 ± 0.632
7.267ArgVal: 7.267 ± 1.897
0.727ArgTrp: 0.727 ± 0.605
4.36ArgTyr: 4.36 ± 1.343
0.0ArgXaa: 0.0 ± 0.0
Ser
5.087SerAla: 5.087 ± 2.136
0.727SerCys: 0.727 ± 0.877
5.814SerAsp: 5.814 ± 2.094
2.907SerGlu: 2.907 ± 0.998
1.453SerPhe: 1.453 ± 1.042
5.087SerGly: 5.087 ± 1.715
0.0SerHis: 0.0 ± 0.0
5.814SerIle: 5.814 ± 2.658
2.907SerLys: 2.907 ± 1.066
5.087SerLeu: 5.087 ± 1.559
2.18SerMet: 2.18 ± 1.769
2.18SerAsn: 2.18 ± 0.632
1.453SerPro: 1.453 ± 0.878
0.727SerGln: 0.727 ± 0.76
5.814SerArg: 5.814 ± 1.885
8.721SerSer: 8.721 ± 4.035
5.087SerThr: 5.087 ± 2.951
2.907SerVal: 2.907 ± 1.355
0.727SerTrp: 0.727 ± 0.521
0.727SerTyr: 0.727 ± 0.76
0.0SerXaa: 0.0 ± 0.0
Thr
6.541ThrAla: 6.541 ± 1.641
0.0ThrCys: 0.0 ± 0.0
2.18ThrAsp: 2.18 ± 0.632
0.0ThrGlu: 0.0 ± 0.0
2.18ThrPhe: 2.18 ± 0.987
5.814ThrGly: 5.814 ± 1.987
2.18ThrHis: 2.18 ± 0.987
1.453ThrIle: 1.453 ± 1.042
2.18ThrLys: 2.18 ± 0.987
8.721ThrLeu: 8.721 ± 2.654
0.0ThrMet: 0.0 ± 0.0
1.453ThrAsn: 1.453 ± 0.878
3.634ThrPro: 3.634 ± 1.199
0.727ThrGln: 0.727 ± 0.76
2.18ThrArg: 2.18 ± 1.243
3.634ThrSer: 3.634 ± 2.606
3.634ThrThr: 3.634 ± 1.9
2.907ThrVal: 2.907 ± 1.446
2.18ThrTrp: 2.18 ± 1.243
0.727ThrTyr: 0.727 ± 0.605
0.0ThrXaa: 0.0 ± 0.0
Val
7.267ValAla: 7.267 ± 1.404
1.453ValCys: 1.453 ± 0.906
2.907ValAsp: 2.907 ± 2.084
3.634ValGlu: 3.634 ± 1.481
2.18ValPhe: 2.18 ± 0.987
2.907ValGly: 2.907 ± 1.207
1.453ValHis: 1.453 ± 0.972
2.907ValIle: 2.907 ± 0.943
2.907ValLys: 2.907 ± 1.556
3.634ValLeu: 3.634 ± 0.823
0.727ValMet: 0.727 ± 0.521
2.18ValAsn: 2.18 ± 1.564
5.814ValPro: 5.814 ± 1.023
0.727ValGln: 0.727 ± 0.765
6.541ValArg: 6.541 ± 0.956
7.994ValSer: 7.994 ± 4.223
2.907ValThr: 2.907 ± 0.695
1.453ValVal: 1.453 ± 0.972
1.453ValTrp: 1.453 ± 0.722
0.727ValTyr: 0.727 ± 0.765
0.0ValXaa: 0.0 ± 0.0
Trp
1.453TrpAla: 1.453 ± 0.722
0.0TrpCys: 0.0 ± 0.0
0.727TrpAsp: 0.727 ± 0.76
1.453TrpGlu: 1.453 ± 1.042
0.727TrpPhe: 0.727 ± 0.521
0.727TrpGly: 0.727 ± 0.605
0.727TrpHis: 0.727 ± 0.521
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.453TrpLeu: 1.453 ± 1.21
0.727TrpMet: 0.727 ± 0.707
1.453TrpAsn: 1.453 ± 1.042
0.0TrpPro: 0.0 ± 0.0
1.453TrpGln: 1.453 ± 0.631
1.453TrpArg: 1.453 ± 0.834
1.453TrpSer: 1.453 ± 0.678
0.727TrpThr: 0.727 ± 0.605
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.727TrpTyr: 0.727 ± 0.605
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.36TyrAla: 4.36 ± 1.769
0.0TyrCys: 0.0 ± 0.0
1.453TyrAsp: 1.453 ± 1.042
1.453TyrGlu: 1.453 ± 1.53
0.727TyrPhe: 0.727 ± 0.521
4.36TyrGly: 4.36 ± 1.298
0.727TyrHis: 0.727 ± 0.605
1.453TyrIle: 1.453 ± 0.722
0.0TyrLys: 0.0 ± 0.0
3.634TyrLeu: 3.634 ± 1.198
0.727TyrMet: 0.727 ± 0.521
1.453TyrAsn: 1.453 ± 0.834
1.453TyrPro: 1.453 ± 1.21
0.727TyrGln: 0.727 ± 0.521
2.907TyrArg: 2.907 ± 0.804
0.0TyrSer: 0.0 ± 0.0
0.727TyrThr: 0.727 ± 0.605
0.727TyrVal: 0.727 ± 0.605
0.727TyrTrp: 0.727 ± 0.521
0.727TyrTyr: 0.727 ± 0.521
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1377 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski