Amino acid dipepetide frequency for Streptococcus phage Javan240

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.937AlaAla: 3.937 ± 0.869
0.505AlaCys: 0.505 ± 0.201
5.148AlaAsp: 5.148 ± 0.722
5.249AlaGlu: 5.249 ± 0.712
3.129AlaPhe: 3.129 ± 0.401
5.047AlaGly: 5.047 ± 1.186
0.606AlaHis: 0.606 ± 0.206
6.259AlaIle: 6.259 ± 1.205
4.745AlaLys: 4.745 ± 0.51
4.946AlaLeu: 4.946 ± 0.684
1.009AlaMet: 1.009 ± 0.352
3.836AlaAsn: 3.836 ± 0.805
1.11AlaPro: 1.11 ± 0.381
3.028AlaGln: 3.028 ± 0.42
3.028AlaArg: 3.028 ± 0.468
4.24AlaSer: 4.24 ± 0.704
3.836AlaThr: 3.836 ± 0.562
4.946AlaVal: 4.946 ± 0.919
1.413AlaTrp: 1.413 ± 0.691
2.12AlaTyr: 2.12 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
0.404CysAla: 0.404 ± 0.194
0.101CysCys: 0.101 ± 0.096
0.808CysAsp: 0.808 ± 0.23
0.404CysGlu: 0.404 ± 0.225
0.101CysPhe: 0.101 ± 0.091
0.606CysGly: 0.606 ± 0.274
0.202CysHis: 0.202 ± 0.149
0.303CysIle: 0.303 ± 0.196
0.303CysLys: 0.303 ± 0.192
0.808CysLeu: 0.808 ± 0.236
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.101CysPro: 0.101 ± 0.091
0.505CysGln: 0.505 ± 0.222
0.404CysArg: 0.404 ± 0.21
0.606CysSer: 0.606 ± 0.208
0.303CysThr: 0.303 ± 0.163
0.505CysVal: 0.505 ± 0.185
0.202CysTrp: 0.202 ± 0.15
0.303CysTyr: 0.303 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
3.937AspAla: 3.937 ± 0.516
0.606AspCys: 0.606 ± 0.293
2.928AspAsp: 2.928 ± 0.809
4.442AspGlu: 4.442 ± 0.645
3.634AspPhe: 3.634 ± 0.565
5.653AspGly: 5.653 ± 0.768
0.606AspHis: 0.606 ± 0.284
5.35AspIle: 5.35 ± 0.6
5.855AspLys: 5.855 ± 0.91
5.451AspLeu: 5.451 ± 0.781
1.312AspMet: 1.312 ± 0.432
3.23AspAsn: 3.23 ± 0.432
1.211AspPro: 1.211 ± 0.612
0.808AspGln: 0.808 ± 0.289
1.817AspArg: 1.817 ± 0.348
3.937AspSer: 3.937 ± 0.685
2.221AspThr: 2.221 ± 0.441
3.432AspVal: 3.432 ± 0.572
1.009AspTrp: 1.009 ± 0.292
2.928AspTyr: 2.928 ± 0.806
0.0AspXaa: 0.0 ± 0.0
Glu
5.148GluAla: 5.148 ± 0.702
0.404GluCys: 0.404 ± 0.223
4.24GluAsp: 4.24 ± 0.704
5.35GluGlu: 5.35 ± 0.956
3.028GluPhe: 3.028 ± 0.574
3.634GluGly: 3.634 ± 0.652
1.009GluHis: 1.009 ± 0.33
6.461GluIle: 6.461 ± 0.977
5.451GluLys: 5.451 ± 0.985
6.36GluLeu: 6.36 ± 0.967
2.928GluMet: 2.928 ± 0.614
4.846GluAsn: 4.846 ± 0.699
2.221GluPro: 2.221 ± 0.477
3.634GluGln: 3.634 ± 0.723
3.432GluArg: 3.432 ± 0.604
4.341GluSer: 4.341 ± 0.626
4.038GluThr: 4.038 ± 0.541
5.249GluVal: 5.249 ± 1.051
0.808GluTrp: 0.808 ± 0.289
2.423GluTyr: 2.423 ± 0.612
0.0GluXaa: 0.0 ± 0.0
Phe
2.524PheAla: 2.524 ± 0.475
0.404PheCys: 0.404 ± 0.185
2.827PheAsp: 2.827 ± 0.471
4.543PheGlu: 4.543 ± 0.653
2.625PhePhe: 2.625 ± 0.73
3.23PheGly: 3.23 ± 0.528
0.606PheHis: 0.606 ± 0.302
2.726PheIle: 2.726 ± 0.516
4.24PheLys: 4.24 ± 0.622
2.524PheLeu: 2.524 ± 0.538
1.312PheMet: 1.312 ± 0.346
2.12PheAsn: 2.12 ± 0.365
1.211PhePro: 1.211 ± 0.356
0.909PheGln: 0.909 ± 0.274
2.12PheArg: 2.12 ± 0.521
3.533PheSer: 3.533 ± 0.795
2.322PheThr: 2.322 ± 0.461
2.928PheVal: 2.928 ± 0.535
0.404PheTrp: 0.404 ± 0.126
1.312PheTyr: 1.312 ± 0.398
0.0PheXaa: 0.0 ± 0.0
Gly
4.442GlyAla: 4.442 ± 1.049
0.101GlyCys: 0.101 ± 0.096
3.432GlyAsp: 3.432 ± 0.681
4.846GlyGlu: 4.846 ± 0.873
4.139GlyPhe: 4.139 ± 1.141
5.35GlyGly: 5.35 ± 0.888
1.514GlyHis: 1.514 ± 0.448
8.177GlyIle: 8.177 ± 2.514
4.442GlyLys: 4.442 ± 0.558
6.461GlyLeu: 6.461 ± 1.351
1.615GlyMet: 1.615 ± 0.365
3.735GlyAsn: 3.735 ± 0.705
0.808GlyPro: 0.808 ± 0.301
3.533GlyGln: 3.533 ± 0.806
3.331GlyArg: 3.331 ± 0.547
3.129GlySer: 3.129 ± 0.58
3.533GlyThr: 3.533 ± 0.643
3.735GlyVal: 3.735 ± 0.732
0.707GlyTrp: 0.707 ± 0.208
3.836GlyTyr: 3.836 ± 0.617
0.0GlyXaa: 0.0 ± 0.0
His
0.606HisAla: 0.606 ± 0.22
0.404HisCys: 0.404 ± 0.189
1.211HisAsp: 1.211 ± 0.439
1.211HisGlu: 1.211 ± 0.387
0.909HisPhe: 0.909 ± 0.307
1.211HisGly: 1.211 ± 0.362
0.101HisHis: 0.101 ± 0.11
1.009HisIle: 1.009 ± 0.34
0.707HisLys: 0.707 ± 0.227
0.909HisLeu: 0.909 ± 0.373
0.101HisMet: 0.101 ± 0.102
0.707HisAsn: 0.707 ± 0.224
0.505HisPro: 0.505 ± 0.241
1.009HisGln: 1.009 ± 0.343
0.404HisArg: 0.404 ± 0.209
1.211HisSer: 1.211 ± 0.274
0.505HisThr: 0.505 ± 0.207
1.009HisVal: 1.009 ± 0.403
0.202HisTrp: 0.202 ± 0.125
1.514HisTyr: 1.514 ± 0.364
0.0HisXaa: 0.0 ± 0.0
Ile
5.249IleAla: 5.249 ± 0.814
0.606IleCys: 0.606 ± 0.209
5.956IleAsp: 5.956 ± 0.859
5.653IleGlu: 5.653 ± 0.663
3.331IlePhe: 3.331 ± 0.611
4.442IleGly: 4.442 ± 0.774
0.909IleHis: 0.909 ± 0.237
4.442IleIle: 4.442 ± 0.69
7.369IleLys: 7.369 ± 1.031
5.754IleLeu: 5.754 ± 0.898
1.11IleMet: 1.11 ± 0.548
4.24IleAsn: 4.24 ± 0.605
2.423IlePro: 2.423 ± 0.397
2.827IleGln: 2.827 ± 0.433
3.432IleArg: 3.432 ± 0.726
6.36IleSer: 6.36 ± 0.826
5.35IleThr: 5.35 ± 1.06
3.735IleVal: 3.735 ± 0.473
0.909IleTrp: 0.909 ± 0.321
1.918IleTyr: 1.918 ± 0.486
0.0IleXaa: 0.0 ± 0.0
Lys
6.158LysAla: 6.158 ± 0.895
0.404LysCys: 0.404 ± 0.195
4.946LysAsp: 4.946 ± 0.858
5.754LysGlu: 5.754 ± 0.807
2.423LysPhe: 2.423 ± 0.514
5.855LysGly: 5.855 ± 0.665
1.009LysHis: 1.009 ± 0.302
5.956LysIle: 5.956 ± 0.745
7.672LysLys: 7.672 ± 0.935
6.461LysLeu: 6.461 ± 0.868
2.423LysMet: 2.423 ± 0.462
5.35LysAsn: 5.35 ± 0.675
1.716LysPro: 1.716 ± 0.502
3.028LysGln: 3.028 ± 0.731
3.432LysArg: 3.432 ± 0.666
4.745LysSer: 4.745 ± 0.646
5.451LysThr: 5.451 ± 0.782
5.35LysVal: 5.35 ± 0.889
1.11LysTrp: 1.11 ± 0.351
4.139LysTyr: 4.139 ± 0.78
0.0LysXaa: 0.0 ± 0.0
Leu
6.158LeuAla: 6.158 ± 0.967
0.303LeuCys: 0.303 ± 0.145
5.047LeuAsp: 5.047 ± 0.692
5.552LeuGlu: 5.552 ± 0.653
4.24LeuPhe: 4.24 ± 0.454
5.552LeuGly: 5.552 ± 0.966
1.11LeuHis: 1.11 ± 0.254
4.946LeuIle: 4.946 ± 0.792
9.388LeuLys: 9.388 ± 0.958
6.259LeuLeu: 6.259 ± 0.844
1.312LeuMet: 1.312 ± 0.402
4.644LeuAsn: 4.644 ± 0.775
2.524LeuPro: 2.524 ± 0.582
3.028LeuGln: 3.028 ± 0.692
3.129LeuArg: 3.129 ± 0.752
5.552LeuSer: 5.552 ± 0.664
4.745LeuThr: 4.745 ± 0.654
3.735LeuVal: 3.735 ± 0.759
1.312LeuTrp: 1.312 ± 0.599
2.827LeuTyr: 2.827 ± 0.614
0.0LeuXaa: 0.0 ± 0.0
Met
1.413MetAla: 1.413 ± 0.381
0.202MetCys: 0.202 ± 0.126
0.808MetAsp: 0.808 ± 0.304
2.423MetGlu: 2.423 ± 0.383
0.202MetPhe: 0.202 ± 0.149
1.413MetGly: 1.413 ± 0.441
0.303MetHis: 0.303 ± 0.192
2.12MetIle: 2.12 ± 0.44
1.716MetLys: 1.716 ± 0.47
1.514MetLeu: 1.514 ± 0.385
0.202MetMet: 0.202 ± 0.162
1.312MetAsn: 1.312 ± 0.362
0.606MetPro: 0.606 ± 0.243
0.909MetGln: 0.909 ± 0.259
1.211MetArg: 1.211 ± 0.495
1.11MetSer: 1.11 ± 0.324
1.716MetThr: 1.716 ± 0.47
1.312MetVal: 1.312 ± 0.316
0.202MetTrp: 0.202 ± 0.181
0.404MetTyr: 0.404 ± 0.151
0.0MetXaa: 0.0 ± 0.0
Asn
4.24AsnAla: 4.24 ± 0.678
0.303AsnCys: 0.303 ± 0.162
2.423AsnAsp: 2.423 ± 0.51
4.24AsnGlu: 4.24 ± 0.809
1.413AsnPhe: 1.413 ± 0.364
5.249AsnGly: 5.249 ± 0.899
0.707AsnHis: 0.707 ± 0.254
3.634AsnIle: 3.634 ± 0.441
3.634AsnLys: 3.634 ± 0.717
5.653AsnLeu: 5.653 ± 0.875
0.909AsnMet: 0.909 ± 0.244
2.928AsnAsn: 2.928 ± 0.583
2.827AsnPro: 2.827 ± 0.555
2.827AsnGln: 2.827 ± 0.582
1.615AsnArg: 1.615 ± 0.442
3.634AsnSer: 3.634 ± 0.685
3.533AsnThr: 3.533 ± 0.522
4.341AsnVal: 4.341 ± 0.623
1.312AsnTrp: 1.312 ± 0.428
1.615AsnTyr: 1.615 ± 0.36
0.0AsnXaa: 0.0 ± 0.0
Pro
1.615ProAla: 1.615 ± 0.35
0.101ProCys: 0.101 ± 0.108
1.918ProAsp: 1.918 ± 0.445
2.423ProGlu: 2.423 ± 0.657
1.514ProPhe: 1.514 ± 0.465
1.11ProGly: 1.11 ± 0.339
0.505ProHis: 0.505 ± 0.243
2.524ProIle: 2.524 ± 0.519
2.726ProLys: 2.726 ± 0.68
2.221ProLeu: 2.221 ± 0.521
0.707ProMet: 0.707 ± 0.196
1.211ProAsn: 1.211 ± 0.317
0.303ProPro: 0.303 ± 0.163
0.707ProGln: 0.707 ± 0.252
0.808ProArg: 0.808 ± 0.316
2.221ProSer: 2.221 ± 0.459
1.615ProThr: 1.615 ± 0.499
1.009ProVal: 1.009 ± 0.343
0.303ProTrp: 0.303 ± 0.201
0.808ProTyr: 0.808 ± 0.259
0.0ProXaa: 0.0 ± 0.0
Gln
3.937GlnAla: 3.937 ± 0.638
0.303GlnCys: 0.303 ± 0.202
1.817GlnAsp: 1.817 ± 0.405
3.634GlnGlu: 3.634 ± 0.544
1.716GlnPhe: 1.716 ± 0.47
3.028GlnGly: 3.028 ± 0.61
0.808GlnHis: 0.808 ± 0.29
2.625GlnIle: 2.625 ± 0.427
3.432GlnLys: 3.432 ± 0.679
2.322GlnLeu: 2.322 ± 0.552
0.909GlnMet: 0.909 ± 0.387
2.726GlnAsn: 2.726 ± 0.616
1.312GlnPro: 1.312 ± 0.266
0.909GlnGln: 0.909 ± 0.298
1.817GlnArg: 1.817 ± 0.418
2.423GlnSer: 2.423 ± 0.412
2.019GlnThr: 2.019 ± 0.566
2.524GlnVal: 2.524 ± 0.564
0.101GlnTrp: 0.101 ± 0.091
1.211GlnTyr: 1.211 ± 0.399
0.0GlnXaa: 0.0 ± 0.0
Arg
1.918ArgAla: 1.918 ± 0.476
0.707ArgCys: 0.707 ± 0.255
2.322ArgAsp: 2.322 ± 0.538
2.726ArgGlu: 2.726 ± 0.412
1.817ArgPhe: 1.817 ± 0.415
2.726ArgGly: 2.726 ± 0.683
0.909ArgHis: 0.909 ± 0.322
3.028ArgIle: 3.028 ± 0.534
5.148ArgLys: 5.148 ± 0.99
3.836ArgLeu: 3.836 ± 0.746
0.808ArgMet: 0.808 ± 0.269
2.524ArgAsn: 2.524 ± 0.504
1.009ArgPro: 1.009 ± 0.329
1.413ArgGln: 1.413 ± 0.316
1.615ArgArg: 1.615 ± 0.531
1.817ArgSer: 1.817 ± 0.412
2.423ArgThr: 2.423 ± 0.549
2.524ArgVal: 2.524 ± 0.541
0.707ArgTrp: 0.707 ± 0.27
2.12ArgTyr: 2.12 ± 0.582
0.0ArgXaa: 0.0 ± 0.0
Ser
3.836SerAla: 3.836 ± 1.366
0.505SerCys: 0.505 ± 0.251
3.937SerAsp: 3.937 ± 0.663
4.846SerGlu: 4.846 ± 0.852
2.423SerPhe: 2.423 ± 0.623
5.35SerGly: 5.35 ± 0.85
1.211SerHis: 1.211 ± 0.405
4.442SerIle: 4.442 ± 0.652
4.038SerLys: 4.038 ± 0.59
5.451SerLeu: 5.451 ± 0.715
1.918SerMet: 1.918 ± 0.562
3.634SerAsn: 3.634 ± 0.548
2.221SerPro: 2.221 ± 0.435
3.028SerGln: 3.028 ± 0.797
2.221SerArg: 2.221 ± 0.607
3.129SerSer: 3.129 ± 0.495
3.331SerThr: 3.331 ± 0.474
4.038SerVal: 4.038 ± 0.764
0.505SerTrp: 0.505 ± 0.206
3.129SerTyr: 3.129 ± 0.619
0.0SerXaa: 0.0 ± 0.0
Thr
4.946ThrAla: 4.946 ± 1.089
0.202ThrCys: 0.202 ± 0.118
3.533ThrAsp: 3.533 ± 0.597
2.827ThrGlu: 2.827 ± 0.539
3.331ThrPhe: 3.331 ± 0.549
4.644ThrGly: 4.644 ± 0.973
0.909ThrHis: 0.909 ± 0.276
4.442ThrIle: 4.442 ± 0.639
3.432ThrLys: 3.432 ± 0.443
5.249ThrLeu: 5.249 ± 0.867
1.009ThrMet: 1.009 ± 0.312
3.432ThrAsn: 3.432 ± 0.542
1.211ThrPro: 1.211 ± 0.451
1.918ThrGln: 1.918 ± 0.437
2.322ThrArg: 2.322 ± 0.567
3.23ThrSer: 3.23 ± 0.64
3.432ThrThr: 3.432 ± 0.639
4.543ThrVal: 4.543 ± 0.64
0.707ThrTrp: 0.707 ± 0.21
1.514ThrTyr: 1.514 ± 0.348
0.0ThrXaa: 0.0 ± 0.0
Val
4.341ValAla: 4.341 ± 0.765
0.404ValCys: 0.404 ± 0.189
3.735ValAsp: 3.735 ± 0.699
4.24ValGlu: 4.24 ± 0.722
1.918ValPhe: 1.918 ± 0.418
3.533ValGly: 3.533 ± 0.511
1.009ValHis: 1.009 ± 0.323
4.038ValIle: 4.038 ± 0.742
5.451ValLys: 5.451 ± 0.679
5.249ValLeu: 5.249 ± 0.842
0.707ValMet: 0.707 ± 0.247
3.129ValAsn: 3.129 ± 0.576
1.817ValPro: 1.817 ± 0.449
2.524ValGln: 2.524 ± 0.619
2.827ValArg: 2.827 ± 0.502
4.644ValSer: 4.644 ± 0.555
4.442ValThr: 4.442 ± 0.718
4.24ValVal: 4.24 ± 0.677
1.11ValTrp: 1.11 ± 0.619
2.019ValTyr: 2.019 ± 0.381
0.0ValXaa: 0.0 ± 0.0
Trp
0.909TrpAla: 0.909 ± 0.325
0.101TrpCys: 0.101 ± 0.096
1.11TrpAsp: 1.11 ± 0.458
1.11TrpGlu: 1.11 ± 0.337
0.707TrpPhe: 0.707 ± 0.259
1.009TrpGly: 1.009 ± 0.341
0.202TrpHis: 0.202 ± 0.145
1.009TrpIle: 1.009 ± 0.357
0.909TrpLys: 0.909 ± 0.329
0.707TrpLeu: 0.707 ± 0.217
0.202TrpMet: 0.202 ± 0.145
1.312TrpAsn: 1.312 ± 0.775
0.202TrpPro: 0.202 ± 0.124
0.707TrpGln: 0.707 ± 0.211
0.808TrpArg: 0.808 ± 0.259
0.808TrpSer: 0.808 ± 0.257
0.505TrpThr: 0.505 ± 0.207
0.808TrpVal: 0.808 ± 0.239
0.101TrpTrp: 0.101 ± 0.088
0.202TrpTyr: 0.202 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.524TyrAla: 2.524 ± 0.533
0.303TyrCys: 0.303 ± 0.193
2.221TyrAsp: 2.221 ± 0.411
3.533TyrGlu: 3.533 ± 0.603
1.817TyrPhe: 1.817 ± 0.47
2.221TyrGly: 2.221 ± 0.442
1.211TyrHis: 1.211 ± 0.353
2.524TyrIle: 2.524 ± 0.526
2.726TyrLys: 2.726 ± 0.607
3.23TyrLeu: 3.23 ± 0.494
0.505TyrMet: 0.505 ± 0.19
2.12TyrAsn: 2.12 ± 0.414
1.009TyrPro: 1.009 ± 0.366
2.322TyrGln: 2.322 ± 0.439
2.322TyrArg: 2.322 ± 0.529
2.524TyrSer: 2.524 ± 0.681
1.514TyrThr: 1.514 ± 0.316
1.413TyrVal: 1.413 ± 0.372
0.303TyrTrp: 0.303 ± 0.158
0.707TyrTyr: 0.707 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (9907 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski