Amino acid dipepetide frequency for Streptococcus satellite phage Javan97

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.504AlaAla: 1.504 ± 0.616
1.253AlaCys: 1.253 ± 0.569
3.008AlaAsp: 3.008 ± 0.709
4.262AlaGlu: 4.262 ± 1.18
3.008AlaPhe: 3.008 ± 0.621
2.006AlaGly: 2.006 ± 0.658
0.501AlaHis: 0.501 ± 0.338
7.27AlaIle: 7.27 ± 1.393
4.763AlaLys: 4.763 ± 0.988
4.763AlaLeu: 4.763 ± 1.031
1.755AlaMet: 1.755 ± 0.632
4.011AlaAsn: 4.011 ± 0.986
1.253AlaPro: 1.253 ± 0.492
2.256AlaGln: 2.256 ± 0.536
2.507AlaArg: 2.507 ± 0.735
4.512AlaSer: 4.512 ± 1.184
3.008AlaThr: 3.008 ± 0.63
2.758AlaVal: 2.758 ± 1.033
1.003AlaTrp: 1.003 ± 0.63
2.507AlaTyr: 2.507 ± 0.849
0.0AlaXaa: 0.0 ± 0.0
Cys
0.501CysAla: 0.501 ± 0.348
0.0CysCys: 0.0 ± 0.0
0.501CysAsp: 0.501 ± 0.296
0.501CysGlu: 0.501 ± 0.346
0.251CysPhe: 0.251 ± 0.267
0.501CysGly: 0.501 ± 0.38
0.251CysHis: 0.251 ± 0.225
0.0CysIle: 0.0 ± 0.0
0.251CysLys: 0.251 ± 0.235
0.501CysLeu: 0.501 ± 0.34
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.251CysPro: 0.251 ± 0.236
0.752CysGln: 0.752 ± 0.478
0.251CysArg: 0.251 ± 0.236
0.251CysSer: 0.251 ± 0.257
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.501CysTyr: 0.501 ± 0.358
0.0CysXaa: 0.0 ± 0.0
Asp
1.504AspAla: 1.504 ± 0.63
0.752AspCys: 0.752 ± 0.533
4.011AspAsp: 4.011 ± 1.051
4.512AspGlu: 4.512 ± 1.453
2.507AspPhe: 2.507 ± 0.746
2.758AspGly: 2.758 ± 0.87
0.501AspHis: 0.501 ± 0.282
8.022AspIle: 8.022 ± 1.076
4.512AspLys: 4.512 ± 0.887
4.763AspLeu: 4.763 ± 0.669
1.755AspMet: 1.755 ± 0.801
2.507AspAsn: 2.507 ± 0.992
1.003AspPro: 1.003 ± 0.419
2.006AspGln: 2.006 ± 0.846
4.011AspArg: 4.011 ± 0.881
2.256AspSer: 2.256 ± 0.53
3.76AspThr: 3.76 ± 1.05
0.501AspVal: 0.501 ± 0.336
0.251AspTrp: 0.251 ± 0.244
3.51AspTyr: 3.51 ± 1.007
0.0AspXaa: 0.0 ± 0.0
Glu
6.267GluAla: 6.267 ± 1.292
1.003GluCys: 1.003 ± 0.616
4.512GluAsp: 4.512 ± 1.161
5.264GluGlu: 5.264 ± 0.973
2.006GluPhe: 2.006 ± 0.859
3.259GluGly: 3.259 ± 0.731
2.006GluHis: 2.006 ± 0.592
5.766GluIle: 5.766 ± 1.516
5.264GluLys: 5.264 ± 0.932
12.284GluLeu: 12.284 ± 1.574
1.755GluMet: 1.755 ± 0.586
4.011GluAsn: 4.011 ± 1.058
1.003GluPro: 1.003 ± 0.367
4.512GluGln: 4.512 ± 1.448
4.011GluArg: 4.011 ± 0.866
2.758GluSer: 2.758 ± 0.874
2.256GluThr: 2.256 ± 0.548
4.262GluVal: 4.262 ± 0.886
1.003GluTrp: 1.003 ± 0.41
3.51GluTyr: 3.51 ± 0.901
0.0GluXaa: 0.0 ± 0.0
Phe
2.507PheAla: 2.507 ± 0.558
0.0PheCys: 0.0 ± 0.0
3.259PheAsp: 3.259 ± 0.677
3.259PheGlu: 3.259 ± 0.992
1.755PhePhe: 1.755 ± 0.546
1.504PheGly: 1.504 ± 0.43
2.006PheHis: 2.006 ± 0.528
3.008PheIle: 3.008 ± 0.944
6.017PheLys: 6.017 ± 0.916
3.008PheLeu: 3.008 ± 0.775
0.752PheMet: 0.752 ± 0.379
3.259PheAsn: 3.259 ± 0.799
1.253PhePro: 1.253 ± 0.512
1.003PheGln: 1.003 ± 0.649
1.504PheArg: 1.504 ± 0.587
2.758PheSer: 2.758 ± 0.695
2.758PheThr: 2.758 ± 0.703
1.504PheVal: 1.504 ± 0.536
0.501PheTrp: 0.501 ± 0.287
1.755PheTyr: 1.755 ± 0.598
0.0PheXaa: 0.0 ± 0.0
Gly
2.758GlyAla: 2.758 ± 1.047
0.0GlyCys: 0.0 ± 0.0
3.008GlyAsp: 3.008 ± 0.955
2.256GlyGlu: 2.256 ± 0.677
2.006GlyPhe: 2.006 ± 0.747
2.256GlyGly: 2.256 ± 0.726
0.501GlyHis: 0.501 ± 0.47
2.758GlyIle: 2.758 ± 0.881
3.76GlyLys: 3.76 ± 0.967
5.264GlyLeu: 5.264 ± 1.142
1.504GlyMet: 1.504 ± 0.472
3.259GlyAsn: 3.259 ± 0.702
0.251GlyPro: 0.251 ± 0.243
3.259GlyGln: 3.259 ± 1.051
3.259GlyArg: 3.259 ± 0.834
2.758GlySer: 2.758 ± 0.817
3.259GlyThr: 3.259 ± 0.578
4.512GlyVal: 4.512 ± 1.115
0.501GlyTrp: 0.501 ± 0.303
4.011GlyTyr: 4.011 ± 1.325
0.0GlyXaa: 0.0 ± 0.0
His
1.504HisAla: 1.504 ± 0.97
0.0HisCys: 0.0 ± 0.0
1.003HisAsp: 1.003 ± 0.692
0.752HisGlu: 0.752 ± 0.346
0.251HisPhe: 0.251 ± 0.233
1.003HisGly: 1.003 ± 0.414
1.003HisHis: 1.003 ± 0.512
1.253HisIle: 1.253 ± 0.49
2.006HisLys: 2.006 ± 0.747
2.256HisLeu: 2.256 ± 0.651
0.0HisMet: 0.0 ± 0.0
2.006HisAsn: 2.006 ± 0.689
1.253HisPro: 1.253 ± 0.453
0.501HisGln: 0.501 ± 0.312
0.501HisArg: 0.501 ± 0.369
0.501HisSer: 0.501 ± 0.431
1.253HisThr: 1.253 ± 0.41
0.251HisVal: 0.251 ± 0.199
0.752HisTrp: 0.752 ± 0.478
2.006HisTyr: 2.006 ± 0.72
0.0HisXaa: 0.0 ± 0.0
Ile
5.766IleAla: 5.766 ± 1.165
0.251IleCys: 0.251 ± 0.225
6.017IleAsp: 6.017 ± 1.218
5.766IleGlu: 5.766 ± 1.006
3.008IlePhe: 3.008 ± 0.758
2.006IleGly: 2.006 ± 0.636
0.501IleHis: 0.501 ± 0.466
5.515IleIle: 5.515 ± 1.468
7.521IleLys: 7.521 ± 1.203
4.512IleLeu: 4.512 ± 0.799
1.755IleMet: 1.755 ± 0.61
4.262IleAsn: 4.262 ± 0.855
2.758IlePro: 2.758 ± 0.707
2.758IleGln: 2.758 ± 0.873
3.259IleArg: 3.259 ± 0.82
5.264IleSer: 5.264 ± 1.622
5.515IleThr: 5.515 ± 1.118
3.259IleVal: 3.259 ± 0.776
0.501IleTrp: 0.501 ± 0.289
2.256IleTyr: 2.256 ± 0.838
0.0IleXaa: 0.0 ± 0.0
Lys
5.766LysAla: 5.766 ± 1.098
0.251LysCys: 0.251 ± 0.251
4.512LysAsp: 4.512 ± 0.975
10.78LysGlu: 10.78 ± 1.387
1.755LysPhe: 1.755 ± 0.562
4.512LysGly: 4.512 ± 1.282
3.259LysHis: 3.259 ± 0.78
5.264LysIle: 5.264 ± 1.557
7.771LysLys: 7.771 ± 1.816
7.521LysLeu: 7.521 ± 1.381
1.755LysMet: 1.755 ± 0.647
4.011LysAsn: 4.011 ± 0.798
5.264LysPro: 5.264 ± 1.243
4.262LysGln: 4.262 ± 1.217
5.014LysArg: 5.014 ± 1.142
3.008LysSer: 3.008 ± 0.976
5.264LysThr: 5.264 ± 1.275
4.763LysVal: 4.763 ± 0.836
0.752LysTrp: 0.752 ± 0.398
1.755LysTyr: 1.755 ± 0.655
0.0LysXaa: 0.0 ± 0.0
Leu
6.267LeuAla: 6.267 ± 1.858
0.501LeuCys: 0.501 ± 0.362
5.264LeuAsp: 5.264 ± 0.999
9.777LeuGlu: 9.777 ± 1.309
4.011LeuPhe: 4.011 ± 0.89
8.022LeuGly: 8.022 ± 1.362
1.504LeuHis: 1.504 ± 0.54
6.769LeuIle: 6.769 ± 1.403
8.774LeuLys: 8.774 ± 1.077
10.529LeuLeu: 10.529 ± 1.389
2.507LeuMet: 2.507 ± 0.701
3.76LeuAsn: 3.76 ± 1.095
5.515LeuPro: 5.515 ± 1.156
2.758LeuGln: 2.758 ± 0.604
2.507LeuArg: 2.507 ± 0.566
8.523LeuSer: 8.523 ± 1.683
3.76LeuThr: 3.76 ± 0.799
5.014LeuVal: 5.014 ± 1.389
1.003LeuTrp: 1.003 ± 0.472
5.766LeuTyr: 5.766 ± 0.79
0.0LeuXaa: 0.0 ± 0.0
Met
2.256MetAla: 2.256 ± 0.672
0.0MetCys: 0.0 ± 0.0
1.003MetAsp: 1.003 ± 0.353
0.752MetGlu: 0.752 ± 0.407
0.501MetPhe: 0.501 ± 0.33
0.501MetGly: 0.501 ± 0.344
0.0MetHis: 0.0 ± 0.0
1.504MetIle: 1.504 ± 0.48
2.758MetLys: 2.758 ± 0.636
2.256MetLeu: 2.256 ± 0.68
0.251MetMet: 0.251 ± 0.229
2.507MetAsn: 2.507 ± 0.554
0.752MetPro: 0.752 ± 0.32
0.752MetGln: 0.752 ± 0.512
1.253MetArg: 1.253 ± 0.448
2.256MetSer: 2.256 ± 0.694
3.259MetThr: 3.259 ± 0.962
0.501MetVal: 0.501 ± 0.285
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.51AsnAla: 3.51 ± 1.075
0.0AsnCys: 0.0 ± 0.0
2.507AsnAsp: 2.507 ± 0.728
3.008AsnGlu: 3.008 ± 0.685
1.755AsnPhe: 1.755 ± 0.956
5.264AsnGly: 5.264 ± 1.128
1.504AsnHis: 1.504 ± 0.565
1.504AsnIle: 1.504 ± 0.544
5.014AsnLys: 5.014 ± 0.907
5.264AsnLeu: 5.264 ± 0.934
1.003AsnMet: 1.003 ± 0.46
3.51AsnAsn: 3.51 ± 0.949
2.006AsnPro: 2.006 ± 0.599
3.51AsnGln: 3.51 ± 1.191
4.512AsnArg: 4.512 ± 0.813
1.755AsnSer: 1.755 ± 0.573
2.006AsnThr: 2.006 ± 0.617
3.259AsnVal: 3.259 ± 0.769
0.501AsnTrp: 0.501 ± 0.35
3.008AsnTyr: 3.008 ± 0.735
0.0AsnXaa: 0.0 ± 0.0
Pro
1.504ProAla: 1.504 ± 0.477
0.251ProCys: 0.251 ± 0.257
2.256ProAsp: 2.256 ± 0.711
4.512ProGlu: 4.512 ± 0.892
1.755ProPhe: 1.755 ± 0.637
0.752ProGly: 0.752 ± 0.425
0.0ProHis: 0.0 ± 0.0
1.253ProIle: 1.253 ± 0.564
3.76ProLys: 3.76 ± 1.072
2.507ProLeu: 2.507 ± 0.879
0.251ProMet: 0.251 ± 0.236
2.256ProAsn: 2.256 ± 0.559
1.003ProPro: 1.003 ± 0.39
0.501ProGln: 0.501 ± 0.331
2.507ProArg: 2.507 ± 0.757
2.256ProSer: 2.256 ± 0.715
3.008ProThr: 3.008 ± 0.632
2.006ProVal: 2.006 ± 0.542
0.251ProTrp: 0.251 ± 0.203
1.253ProTyr: 1.253 ± 0.489
0.0ProXaa: 0.0 ± 0.0
Gln
2.006GlnAla: 2.006 ± 0.58
0.0GlnCys: 0.0 ± 0.0
2.758GlnAsp: 2.758 ± 0.984
4.763GlnGlu: 4.763 ± 1.083
1.755GlnPhe: 1.755 ± 0.799
1.755GlnGly: 1.755 ± 0.526
1.504GlnHis: 1.504 ± 0.64
2.256GlnIle: 2.256 ± 0.593
5.014GlnLys: 5.014 ± 1.043
5.766GlnLeu: 5.766 ± 1.107
1.003GlnMet: 1.003 ± 0.751
1.755GlnAsn: 1.755 ± 0.783
1.504GlnPro: 1.504 ± 0.612
1.755GlnGln: 1.755 ± 0.526
2.507GlnArg: 2.507 ± 0.726
2.507GlnSer: 2.507 ± 0.681
2.256GlnThr: 2.256 ± 0.805
2.758GlnVal: 2.758 ± 1.106
0.0GlnTrp: 0.0 ± 0.0
1.253GlnTyr: 1.253 ± 0.477
0.0GlnXaa: 0.0 ± 0.0
Arg
2.006ArgAla: 2.006 ± 0.613
0.501ArgCys: 0.501 ± 0.322
1.755ArgAsp: 1.755 ± 0.597
3.259ArgGlu: 3.259 ± 1.308
2.507ArgPhe: 2.507 ± 0.73
2.256ArgGly: 2.256 ± 0.829
1.253ArgHis: 1.253 ± 0.608
2.758ArgIle: 2.758 ± 0.743
4.763ArgLys: 4.763 ± 0.957
6.769ArgLeu: 6.769 ± 1.036
1.253ArgMet: 1.253 ± 0.585
2.758ArgAsn: 2.758 ± 0.728
1.003ArgPro: 1.003 ± 0.428
3.76ArgGln: 3.76 ± 0.959
2.006ArgArg: 2.006 ± 0.692
2.256ArgSer: 2.256 ± 0.854
2.758ArgThr: 2.758 ± 0.653
3.51ArgVal: 3.51 ± 0.942
0.501ArgTrp: 0.501 ± 0.393
3.51ArgTyr: 3.51 ± 0.895
0.0ArgXaa: 0.0 ± 0.0
Ser
3.008SerAla: 3.008 ± 0.679
0.251SerCys: 0.251 ± 0.236
3.76SerAsp: 3.76 ± 0.717
3.51SerGlu: 3.51 ± 0.823
3.76SerPhe: 3.76 ± 1.165
2.006SerGly: 2.006 ± 0.725
0.752SerHis: 0.752 ± 0.514
5.014SerIle: 5.014 ± 0.897
4.262SerLys: 4.262 ± 0.98
7.771SerLeu: 7.771 ± 1.474
1.253SerMet: 1.253 ± 0.431
1.253SerAsn: 1.253 ± 0.494
1.504SerPro: 1.504 ± 0.648
2.006SerGln: 2.006 ± 0.788
3.008SerArg: 3.008 ± 0.974
2.256SerSer: 2.256 ± 0.748
2.758SerThr: 2.758 ± 0.893
4.011SerVal: 4.011 ± 1.033
1.003SerTrp: 1.003 ± 0.541
3.259SerTyr: 3.259 ± 0.71
0.0SerXaa: 0.0 ± 0.0
Thr
3.259ThrAla: 3.259 ± 0.937
0.0ThrCys: 0.0 ± 0.0
1.504ThrAsp: 1.504 ± 0.55
3.51ThrGlu: 3.51 ± 1.114
3.259ThrPhe: 3.259 ± 1.378
4.763ThrGly: 4.763 ± 0.902
1.003ThrHis: 1.003 ± 0.433
4.262ThrIle: 4.262 ± 0.978
2.256ThrLys: 2.256 ± 0.648
5.264ThrLeu: 5.264 ± 0.94
1.755ThrMet: 1.755 ± 0.621
1.504ThrAsn: 1.504 ± 0.58
3.76ThrPro: 3.76 ± 0.756
3.008ThrGln: 3.008 ± 0.755
3.259ThrArg: 3.259 ± 0.847
3.76ThrSer: 3.76 ± 0.935
3.259ThrThr: 3.259 ± 0.922
2.256ThrVal: 2.256 ± 0.685
0.501ThrTrp: 0.501 ± 0.281
2.758ThrTyr: 2.758 ± 0.869
0.0ThrXaa: 0.0 ± 0.0
Val
3.259ValAla: 3.259 ± 0.606
0.0ValCys: 0.0 ± 0.0
1.755ValAsp: 1.755 ± 0.566
2.507ValGlu: 2.507 ± 0.889
3.259ValPhe: 3.259 ± 0.981
3.008ValGly: 3.008 ± 0.661
0.752ValHis: 0.752 ± 0.464
5.264ValIle: 5.264 ± 0.893
3.51ValLys: 3.51 ± 0.706
5.766ValLeu: 5.766 ± 1.251
1.003ValMet: 1.003 ± 0.389
3.259ValAsn: 3.259 ± 1.043
1.253ValPro: 1.253 ± 0.536
2.256ValGln: 2.256 ± 0.69
1.504ValArg: 1.504 ± 0.577
4.011ValSer: 4.011 ± 0.87
3.259ValThr: 3.259 ± 0.97
3.51ValVal: 3.51 ± 0.943
0.752ValTrp: 0.752 ± 0.455
1.253ValTyr: 1.253 ± 0.388
0.0ValXaa: 0.0 ± 0.0
Trp
0.251TrpAla: 0.251 ± 0.203
0.0TrpCys: 0.0 ± 0.0
0.752TrpAsp: 0.752 ± 0.406
1.003TrpGlu: 1.003 ± 0.511
0.501TrpPhe: 0.501 ± 0.406
0.251TrpGly: 0.251 ± 0.235
0.0TrpHis: 0.0 ± 0.0
0.501TrpIle: 0.501 ± 0.399
0.752TrpLys: 0.752 ± 0.425
2.006TrpLeu: 2.006 ± 0.634
0.0TrpMet: 0.0 ± 0.0
0.251TrpAsn: 0.251 ± 0.203
0.251TrpPro: 0.251 ± 0.199
0.251TrpGln: 0.251 ± 0.296
0.752TrpArg: 0.752 ± 0.423
0.752TrpSer: 0.752 ± 0.356
0.0TrpThr: 0.0 ± 0.0
1.003TrpVal: 1.003 ± 0.494
0.0TrpTrp: 0.0 ± 0.0
1.003TrpTyr: 1.003 ± 0.619
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.507TyrAla: 2.507 ± 1.243
0.251TyrCys: 0.251 ± 0.243
2.256TyrAsp: 2.256 ± 0.573
2.758TyrGlu: 2.758 ± 0.85
3.76TyrPhe: 3.76 ± 0.706
2.758TyrGly: 2.758 ± 0.73
1.253TyrHis: 1.253 ± 0.467
2.507TyrIle: 2.507 ± 0.679
4.512TyrLys: 4.512 ± 0.807
3.76TyrLeu: 3.76 ± 0.709
1.504TyrMet: 1.504 ± 0.67
4.262TyrAsn: 4.262 ± 1.071
1.003TyrPro: 1.003 ± 0.715
3.008TyrGln: 3.008 ± 0.961
3.259TyrArg: 3.259 ± 1.163
2.006TyrSer: 2.006 ± 0.768
1.504TyrThr: 1.504 ± 0.479
1.504TyrVal: 1.504 ± 0.5
0.501TyrTrp: 0.501 ± 0.298
3.008TyrTyr: 3.008 ± 0.893
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (3990 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski