Amino acid dipepetide frequency for Streptococcus phage Javan100

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.769AlaAla: 4.769 ± 1.085
0.779AlaCys: 0.779 ± 0.292
3.699AlaAsp: 3.699 ± 0.61
5.256AlaGlu: 5.256 ± 0.882
2.725AlaPhe: 2.725 ± 0.465
3.212AlaGly: 3.212 ± 0.559
0.584AlaHis: 0.584 ± 0.229
5.548AlaIle: 5.548 ± 1.0
5.159AlaLys: 5.159 ± 0.723
7.787AlaLeu: 7.787 ± 1.323
1.557AlaMet: 1.557 ± 0.436
3.309AlaAsn: 3.309 ± 0.612
1.363AlaPro: 1.363 ± 0.409
2.239AlaGln: 2.239 ± 0.5
2.823AlaArg: 2.823 ± 0.466
4.672AlaSer: 4.672 ± 0.844
4.38AlaThr: 4.38 ± 0.582
4.477AlaVal: 4.477 ± 0.723
0.779AlaTrp: 0.779 ± 0.285
3.309AlaTyr: 3.309 ± 0.475
0.0AlaXaa: 0.0 ± 0.0
Cys
0.292CysAla: 0.292 ± 0.149
0.097CysCys: 0.097 ± 0.092
0.292CysAsp: 0.292 ± 0.148
0.779CysGlu: 0.779 ± 0.232
0.097CysPhe: 0.097 ± 0.083
0.584CysGly: 0.584 ± 0.191
0.195CysHis: 0.195 ± 0.123
0.487CysIle: 0.487 ± 0.208
0.584CysLys: 0.584 ± 0.35
0.487CysLeu: 0.487 ± 0.243
0.097CysMet: 0.097 ± 0.084
0.389CysAsn: 0.389 ± 0.191
0.097CysPro: 0.097 ± 0.104
0.487CysGln: 0.487 ± 0.198
0.487CysArg: 0.487 ± 0.217
0.487CysSer: 0.487 ± 0.209
0.292CysThr: 0.292 ± 0.162
0.487CysVal: 0.487 ± 0.216
0.097CysTrp: 0.097 ± 0.094
0.779CysTyr: 0.779 ± 0.251
0.0CysXaa: 0.0 ± 0.0
Asp
3.115AspAla: 3.115 ± 0.411
0.584AspCys: 0.584 ± 0.254
3.212AspAsp: 3.212 ± 0.741
4.185AspGlu: 4.185 ± 0.711
3.212AspPhe: 3.212 ± 0.361
4.769AspGly: 4.769 ± 0.524
0.389AspHis: 0.389 ± 0.196
3.017AspIle: 3.017 ± 0.323
3.796AspLys: 3.796 ± 0.479
4.672AspLeu: 4.672 ± 0.647
2.141AspMet: 2.141 ± 0.524
1.947AspAsn: 1.947 ± 0.291
1.557AspPro: 1.557 ± 0.404
1.557AspGln: 1.557 ± 0.301
2.044AspArg: 2.044 ± 0.483
4.38AspSer: 4.38 ± 0.583
3.017AspThr: 3.017 ± 0.578
3.115AspVal: 3.115 ± 0.581
1.168AspTrp: 1.168 ± 0.264
1.947AspTyr: 1.947 ± 0.465
0.0AspXaa: 0.0 ± 0.0
Glu
6.327GluAla: 6.327 ± 0.854
0.389GluCys: 0.389 ± 0.174
4.38GluAsp: 4.38 ± 0.726
8.565GluGlu: 8.565 ± 1.289
2.531GluPhe: 2.531 ± 0.549
5.743GluGly: 5.743 ± 0.714
1.752GluHis: 1.752 ± 0.406
4.088GluIle: 4.088 ± 0.535
6.521GluLys: 6.521 ± 0.714
8.857GluLeu: 8.857 ± 0.851
1.752GluMet: 1.752 ± 0.367
4.185GluAsn: 4.185 ± 0.749
2.239GluPro: 2.239 ± 0.531
4.38GluGln: 4.38 ± 0.858
3.699GluArg: 3.699 ± 0.753
3.796GluSer: 3.796 ± 0.494
4.964GluThr: 4.964 ± 0.742
5.061GluVal: 5.061 ± 0.606
1.071GluTrp: 1.071 ± 0.281
1.849GluTyr: 1.849 ± 0.453
0.0GluXaa: 0.0 ± 0.0
Phe
1.849PheAla: 1.849 ± 0.505
0.389PheCys: 0.389 ± 0.191
2.92PheAsp: 2.92 ± 0.464
2.92PheGlu: 2.92 ± 0.632
1.363PhePhe: 1.363 ± 0.423
3.017PheGly: 3.017 ± 0.382
0.487PheHis: 0.487 ± 0.21
2.433PheIle: 2.433 ± 0.437
2.628PheLys: 2.628 ± 0.559
3.407PheLeu: 3.407 ± 0.75
0.779PheMet: 0.779 ± 0.303
1.265PheAsn: 1.265 ± 0.259
0.681PhePro: 0.681 ± 0.264
2.239PheGln: 2.239 ± 0.498
1.363PheArg: 1.363 ± 0.333
2.92PheSer: 2.92 ± 0.473
2.044PheThr: 2.044 ± 0.525
2.433PheVal: 2.433 ± 0.391
0.681PheTrp: 0.681 ± 0.248
1.849PheTyr: 1.849 ± 0.401
0.0PheXaa: 0.0 ± 0.0
Gly
3.601GlyAla: 3.601 ± 0.774
0.487GlyCys: 0.487 ± 0.171
3.991GlyAsp: 3.991 ± 0.55
4.185GlyGlu: 4.185 ± 0.704
2.628GlyPhe: 2.628 ± 0.507
4.672GlyGly: 4.672 ± 0.584
1.071GlyHis: 1.071 ± 0.25
5.645GlyIle: 5.645 ± 1.025
5.743GlyLys: 5.743 ± 0.817
5.84GlyLeu: 5.84 ± 0.927
1.849GlyMet: 1.849 ± 0.466
3.115GlyAsn: 3.115 ± 0.614
0.681GlyPro: 0.681 ± 0.219
2.239GlyGln: 2.239 ± 0.455
4.477GlyArg: 4.477 ± 0.462
3.699GlySer: 3.699 ± 0.39
3.991GlyThr: 3.991 ± 0.777
4.088GlyVal: 4.088 ± 0.714
1.168GlyTrp: 1.168 ± 0.268
1.752GlyTyr: 1.752 ± 0.34
0.0GlyXaa: 0.0 ± 0.0
His
1.071HisAla: 1.071 ± 0.287
0.097HisCys: 0.097 ± 0.094
0.487HisAsp: 0.487 ± 0.227
1.071HisGlu: 1.071 ± 0.278
0.779HisPhe: 0.779 ± 0.214
0.876HisGly: 0.876 ± 0.272
0.584HisHis: 0.584 ± 0.207
1.46HisIle: 1.46 ± 0.359
0.973HisLys: 0.973 ± 0.323
2.336HisLeu: 2.336 ± 0.419
0.292HisMet: 0.292 ± 0.184
0.681HisAsn: 0.681 ± 0.23
1.071HisPro: 1.071 ± 0.321
0.779HisGln: 0.779 ± 0.229
1.363HisArg: 1.363 ± 0.354
1.265HisSer: 1.265 ± 0.452
0.876HisThr: 0.876 ± 0.397
1.265HisVal: 1.265 ± 0.336
0.097HisTrp: 0.097 ± 0.096
0.681HisTyr: 0.681 ± 0.323
0.0HisXaa: 0.0 ± 0.0
Ile
3.991IleAla: 3.991 ± 0.587
0.487IleCys: 0.487 ± 0.254
4.38IleAsp: 4.38 ± 0.513
3.991IleGlu: 3.991 ± 0.526
1.655IlePhe: 1.655 ± 0.415
3.991IleGly: 3.991 ± 0.621
0.779IleHis: 0.779 ± 0.213
2.628IleIle: 2.628 ± 0.603
4.185IleLys: 4.185 ± 0.782
6.327IleLeu: 6.327 ± 0.766
0.973IleMet: 0.973 ± 0.33
2.336IleAsn: 2.336 ± 0.437
2.044IlePro: 2.044 ± 0.461
2.92IleGln: 2.92 ± 0.539
3.407IleArg: 3.407 ± 0.488
4.672IleSer: 4.672 ± 0.669
2.531IleThr: 2.531 ± 0.566
5.353IleVal: 5.353 ± 0.705
1.363IleTrp: 1.363 ± 0.39
2.336IleTyr: 2.336 ± 0.365
0.0IleXaa: 0.0 ± 0.0
Lys
5.84LysAla: 5.84 ± 0.693
0.195LysCys: 0.195 ± 0.13
3.893LysAsp: 3.893 ± 0.665
8.079LysGlu: 8.079 ± 0.993
2.92LysPhe: 2.92 ± 0.564
5.159LysGly: 5.159 ± 0.74
1.363LysHis: 1.363 ± 0.356
4.477LysIle: 4.477 ± 0.648
5.84LysLys: 5.84 ± 0.985
6.229LysLeu: 6.229 ± 0.742
1.557LysMet: 1.557 ± 0.392
3.309LysAsn: 3.309 ± 0.655
3.017LysPro: 3.017 ± 0.532
3.991LysGln: 3.991 ± 0.614
4.185LysArg: 4.185 ± 0.714
4.185LysSer: 4.185 ± 0.482
3.991LysThr: 3.991 ± 0.558
4.575LysVal: 4.575 ± 0.66
0.973LysTrp: 0.973 ± 0.238
1.849LysTyr: 1.849 ± 0.488
0.0LysXaa: 0.0 ± 0.0
Leu
8.273LeuAla: 8.273 ± 1.264
0.584LeuCys: 0.584 ± 0.176
5.256LeuAsp: 5.256 ± 0.648
8.663LeuGlu: 8.663 ± 1.152
3.504LeuPhe: 3.504 ± 0.529
5.353LeuGly: 5.353 ± 0.58
1.752LeuHis: 1.752 ± 0.42
4.867LeuIle: 4.867 ± 0.666
9.052LeuLys: 9.052 ± 0.753
8.176LeuLeu: 8.176 ± 0.699
2.336LeuMet: 2.336 ± 0.401
4.185LeuAsn: 4.185 ± 0.711
3.309LeuPro: 3.309 ± 0.556
4.088LeuGln: 4.088 ± 0.557
4.38LeuArg: 4.38 ± 0.625
7.689LeuSer: 7.689 ± 0.715
6.424LeuThr: 6.424 ± 0.936
4.867LeuVal: 4.867 ± 0.8
0.681LeuTrp: 0.681 ± 0.246
4.672LeuTyr: 4.672 ± 0.725
0.0LeuXaa: 0.0 ± 0.0
Met
1.363MetAla: 1.363 ± 0.387
0.195MetCys: 0.195 ± 0.149
1.655MetAsp: 1.655 ± 0.373
2.433MetGlu: 2.433 ± 0.497
0.389MetPhe: 0.389 ± 0.188
1.752MetGly: 1.752 ± 0.363
0.097MetHis: 0.097 ± 0.103
1.168MetIle: 1.168 ± 0.389
2.044MetLys: 2.044 ± 0.438
1.655MetLeu: 1.655 ± 0.44
0.681MetMet: 0.681 ± 0.351
0.487MetAsn: 0.487 ± 0.197
0.292MetPro: 0.292 ± 0.184
0.584MetGln: 0.584 ± 0.208
1.168MetArg: 1.168 ± 0.35
1.849MetSer: 1.849 ± 0.451
1.947MetThr: 1.947 ± 0.427
2.336MetVal: 2.336 ± 0.561
0.097MetTrp: 0.097 ± 0.09
0.487MetTyr: 0.487 ± 0.266
0.0MetXaa: 0.0 ± 0.0
Asn
3.601AsnAla: 3.601 ± 0.731
0.097AsnCys: 0.097 ± 0.102
1.071AsnAsp: 1.071 ± 0.332
2.725AsnGlu: 2.725 ± 0.449
1.752AsnPhe: 1.752 ± 0.337
3.893AsnGly: 3.893 ± 0.61
1.071AsnHis: 1.071 ± 0.318
2.725AsnIle: 2.725 ± 0.478
3.017AsnLys: 3.017 ± 0.454
4.283AsnLeu: 4.283 ± 0.74
1.265AsnMet: 1.265 ± 0.326
1.46AsnAsn: 1.46 ± 0.342
2.628AsnPro: 2.628 ± 0.506
2.433AsnGln: 2.433 ± 0.527
2.92AsnArg: 2.92 ± 0.516
2.141AsnSer: 2.141 ± 0.383
1.655AsnThr: 1.655 ± 0.363
2.433AsnVal: 2.433 ± 0.552
0.681AsnTrp: 0.681 ± 0.333
1.363AsnTyr: 1.363 ± 0.318
0.0AsnXaa: 0.0 ± 0.0
Pro
1.557ProAla: 1.557 ± 0.34
0.195ProCys: 0.195 ± 0.127
1.947ProAsp: 1.947 ± 0.483
2.336ProGlu: 2.336 ± 0.425
1.071ProPhe: 1.071 ± 0.385
2.239ProGly: 2.239 ± 0.584
0.779ProHis: 0.779 ± 0.273
1.752ProIle: 1.752 ± 0.399
1.557ProLys: 1.557 ± 0.418
3.309ProLeu: 3.309 ± 0.428
0.292ProMet: 0.292 ± 0.191
0.876ProAsn: 0.876 ± 0.332
0.973ProPro: 0.973 ± 0.386
1.655ProGln: 1.655 ± 0.417
1.46ProArg: 1.46 ± 0.33
2.239ProSer: 2.239 ± 0.537
1.849ProThr: 1.849 ± 0.411
2.433ProVal: 2.433 ± 0.582
0.487ProTrp: 0.487 ± 0.224
1.168ProTyr: 1.168 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
4.283GlnAla: 4.283 ± 0.582
0.389GlnCys: 0.389 ± 0.24
1.947GlnAsp: 1.947 ± 0.461
2.92GlnGlu: 2.92 ± 0.5
2.823GlnPhe: 2.823 ± 0.528
2.725GlnGly: 2.725 ± 0.487
0.876GlnHis: 0.876 ± 0.266
2.239GlnIle: 2.239 ± 0.707
4.088GlnLys: 4.088 ± 0.656
5.061GlnLeu: 5.061 ± 0.725
0.779GlnMet: 0.779 ± 0.232
2.92GlnAsn: 2.92 ± 0.446
1.071GlnPro: 1.071 ± 0.341
1.849GlnGln: 1.849 ± 0.478
1.46GlnArg: 1.46 ± 0.391
2.141GlnSer: 2.141 ± 0.542
3.017GlnThr: 3.017 ± 0.717
4.575GlnVal: 4.575 ± 0.728
0.584GlnTrp: 0.584 ± 0.25
0.779GlnTyr: 0.779 ± 0.208
0.0GlnXaa: 0.0 ± 0.0
Arg
2.92ArgAla: 2.92 ± 0.426
0.487ArgCys: 0.487 ± 0.23
1.947ArgAsp: 1.947 ± 0.371
3.796ArgGlu: 3.796 ± 0.555
1.947ArgPhe: 1.947 ± 0.514
2.433ArgGly: 2.433 ± 0.513
0.681ArgHis: 0.681 ± 0.255
2.531ArgIle: 2.531 ± 0.409
4.38ArgLys: 4.38 ± 0.827
5.743ArgLeu: 5.743 ± 0.702
1.071ArgMet: 1.071 ± 0.273
2.044ArgAsn: 2.044 ± 0.347
1.46ArgPro: 1.46 ± 0.428
2.628ArgGln: 2.628 ± 0.494
2.725ArgArg: 2.725 ± 0.54
2.628ArgSer: 2.628 ± 0.577
2.239ArgThr: 2.239 ± 0.487
3.991ArgVal: 3.991 ± 0.714
0.681ArgTrp: 0.681 ± 0.259
2.044ArgTyr: 2.044 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
3.796SerAla: 3.796 ± 0.752
0.292SerCys: 0.292 ± 0.208
2.823SerAsp: 2.823 ± 0.483
5.548SerGlu: 5.548 ± 0.801
2.92SerPhe: 2.92 ± 0.666
4.769SerGly: 4.769 ± 0.603
1.752SerHis: 1.752 ± 0.374
4.283SerIle: 4.283 ± 0.532
4.283SerLys: 4.283 ± 0.723
5.743SerLeu: 5.743 ± 0.554
1.46SerMet: 1.46 ± 0.434
2.336SerAsn: 2.336 ± 0.593
2.044SerPro: 2.044 ± 0.416
3.017SerGln: 3.017 ± 0.52
3.601SerArg: 3.601 ± 0.594
4.867SerSer: 4.867 ± 0.684
4.283SerThr: 4.283 ± 0.723
3.115SerVal: 3.115 ± 0.639
1.46SerTrp: 1.46 ± 0.388
2.725SerTyr: 2.725 ± 0.5
0.0SerXaa: 0.0 ± 0.0
Thr
4.38ThrAla: 4.38 ± 0.607
0.389ThrCys: 0.389 ± 0.171
2.141ThrAsp: 2.141 ± 0.407
4.38ThrGlu: 4.38 ± 0.611
1.46ThrPhe: 1.46 ± 0.298
2.92ThrGly: 2.92 ± 0.544
0.779ThrHis: 0.779 ± 0.27
4.769ThrIle: 4.769 ± 0.755
4.964ThrLys: 4.964 ± 0.683
7.105ThrLeu: 7.105 ± 1.07
0.876ThrMet: 0.876 ± 0.25
2.531ThrAsn: 2.531 ± 0.418
1.071ThrPro: 1.071 ± 0.388
2.336ThrGln: 2.336 ± 0.459
2.044ThrArg: 2.044 ± 0.503
3.991ThrSer: 3.991 ± 0.505
3.699ThrThr: 3.699 ± 0.839
5.937ThrVal: 5.937 ± 0.77
0.876ThrTrp: 0.876 ± 0.255
1.655ThrTyr: 1.655 ± 0.375
0.0ThrXaa: 0.0 ± 0.0
Val
3.991ValAla: 3.991 ± 0.628
0.681ValCys: 0.681 ± 0.343
4.769ValAsp: 4.769 ± 0.697
5.256ValGlu: 5.256 ± 0.734
1.849ValPhe: 1.849 ± 0.413
3.504ValGly: 3.504 ± 0.585
1.363ValHis: 1.363 ± 0.341
4.185ValIle: 4.185 ± 0.678
3.407ValLys: 3.407 ± 0.618
7.008ValLeu: 7.008 ± 0.674
1.947ValMet: 1.947 ± 0.515
2.725ValAsn: 2.725 ± 0.59
2.823ValPro: 2.823 ± 0.569
3.796ValGln: 3.796 ± 0.635
3.212ValArg: 3.212 ± 0.521
4.185ValSer: 4.185 ± 0.693
4.769ValThr: 4.769 ± 0.801
3.601ValVal: 3.601 ± 0.503
1.46ValTrp: 1.46 ± 0.309
2.823ValTyr: 2.823 ± 0.587
0.0ValXaa: 0.0 ± 0.0
Trp
1.168TrpAla: 1.168 ± 0.344
0.292TrpCys: 0.292 ± 0.147
0.779TrpAsp: 0.779 ± 0.234
1.947TrpGlu: 1.947 ± 0.379
0.973TrpPhe: 0.973 ± 0.331
0.681TrpGly: 0.681 ± 0.213
0.195TrpHis: 0.195 ± 0.118
0.487TrpIle: 0.487 ± 0.236
0.876TrpLys: 0.876 ± 0.32
1.363TrpLeu: 1.363 ± 0.259
0.195TrpMet: 0.195 ± 0.133
1.752TrpAsn: 1.752 ± 0.43
0.389TrpPro: 0.389 ± 0.246
1.071TrpGln: 1.071 ± 0.253
0.389TrpArg: 0.389 ± 0.215
0.876TrpSer: 0.876 ± 0.253
0.487TrpThr: 0.487 ± 0.208
0.973TrpVal: 0.973 ± 0.308
0.292TrpTrp: 0.292 ± 0.137
0.292TrpTyr: 0.292 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.239TyrAla: 2.239 ± 0.463
0.487TyrCys: 0.487 ± 0.244
2.239TyrAsp: 2.239 ± 0.645
3.115TyrGlu: 3.115 ± 0.427
1.071TyrPhe: 1.071 ± 0.381
2.628TyrGly: 2.628 ± 0.52
1.557TyrHis: 1.557 ± 0.354
1.557TyrIle: 1.557 ± 0.425
2.531TyrLys: 2.531 ± 0.537
2.823TyrLeu: 2.823 ± 0.474
0.681TyrMet: 0.681 ± 0.316
1.363TyrAsn: 1.363 ± 0.412
1.363TyrPro: 1.363 ± 0.341
2.239TyrGln: 2.239 ± 0.488
1.168TyrArg: 1.168 ± 0.258
2.433TyrSer: 2.433 ± 0.593
1.849TyrThr: 1.849 ± 0.532
2.239TyrVal: 2.239 ± 0.609
0.779TyrTrp: 0.779 ± 0.28
1.655TyrTyr: 1.655 ± 0.432
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 37 proteins (10275 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski