Amino acid dipepetide frequency for Streptococcus phage phi-SC181

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.928AlaAla: 2.928 ± 0.465
0.502AlaCys: 0.502 ± 0.193
3.597AlaAsp: 3.597 ± 0.565
4.099AlaGlu: 4.099 ± 0.619
3.43AlaPhe: 3.43 ± 0.438
4.434AlaGly: 4.434 ± 0.845
0.502AlaHis: 0.502 ± 0.234
5.271AlaIle: 5.271 ± 0.878
4.852AlaLys: 4.852 ± 0.525
5.103AlaLeu: 5.103 ± 0.701
2.092AlaMet: 2.092 ± 0.439
3.179AlaAsn: 3.179 ± 0.628
1.422AlaPro: 1.422 ± 0.331
1.59AlaGln: 1.59 ± 0.279
2.928AlaArg: 2.928 ± 0.44
4.434AlaSer: 4.434 ± 0.982
3.681AlaThr: 3.681 ± 0.554
3.765AlaVal: 3.765 ± 0.628
0.418AlaTrp: 0.418 ± 0.22
3.346AlaTyr: 3.346 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.753CysAla: 0.753 ± 0.32
0.418CysCys: 0.418 ± 0.17
0.669CysAsp: 0.669 ± 0.21
0.502CysGlu: 0.502 ± 0.173
0.251CysPhe: 0.251 ± 0.138
0.92CysGly: 0.92 ± 0.184
0.167CysHis: 0.167 ± 0.123
0.753CysIle: 0.753 ± 0.24
0.586CysLys: 0.586 ± 0.228
0.753CysLeu: 0.753 ± 0.369
0.084CysMet: 0.084 ± 0.075
0.418CysAsn: 0.418 ± 0.164
0.418CysPro: 0.418 ± 0.21
0.92CysGln: 0.92 ± 0.323
0.502CysArg: 0.502 ± 0.237
0.669CysSer: 0.669 ± 0.243
0.251CysThr: 0.251 ± 0.183
0.669CysVal: 0.669 ± 0.239
0.0CysTrp: 0.0 ± 0.0
0.837CysTyr: 0.837 ± 0.329
0.0CysXaa: 0.0 ± 0.0
Asp
3.012AspAla: 3.012 ± 0.515
0.586AspCys: 0.586 ± 0.257
3.597AspAsp: 3.597 ± 0.657
5.856AspGlu: 5.856 ± 0.876
3.932AspPhe: 3.932 ± 0.528
4.35AspGly: 4.35 ± 0.743
1.171AspHis: 1.171 ± 0.361
4.685AspIle: 4.685 ± 0.623
3.848AspLys: 3.848 ± 0.482
5.02AspLeu: 5.02 ± 0.949
1.506AspMet: 1.506 ± 0.423
3.346AspAsn: 3.346 ± 0.455
1.757AspPro: 1.757 ± 0.45
1.673AspGln: 1.673 ± 0.298
2.928AspArg: 2.928 ± 0.621
3.43AspSer: 3.43 ± 0.661
3.095AspThr: 3.095 ± 0.496
2.844AspVal: 2.844 ± 0.668
0.92AspTrp: 0.92 ± 0.243
3.765AspTyr: 3.765 ± 0.723
0.0AspXaa: 0.0 ± 0.0
Glu
3.765GluAla: 3.765 ± 0.444
0.753GluCys: 0.753 ± 0.24
4.434GluAsp: 4.434 ± 0.572
4.518GluGlu: 4.518 ± 0.651
2.761GluPhe: 2.761 ± 0.576
4.769GluGly: 4.769 ± 0.502
1.004GluHis: 1.004 ± 0.363
4.936GluIle: 4.936 ± 0.774
6.275GluLys: 6.275 ± 1.171
7.529GluLeu: 7.529 ± 0.627
1.924GluMet: 1.924 ± 0.606
4.099GluAsn: 4.099 ± 0.786
1.757GluPro: 1.757 ± 0.472
4.601GluGln: 4.601 ± 0.482
2.593GluArg: 2.593 ± 0.508
3.012GluSer: 3.012 ± 0.352
3.514GluThr: 3.514 ± 0.523
4.434GluVal: 4.434 ± 0.568
1.004GluTrp: 1.004 ± 0.311
2.928GluTyr: 2.928 ± 0.962
0.0GluXaa: 0.0 ± 0.0
Phe
2.593PheAla: 2.593 ± 0.594
0.669PheCys: 0.669 ± 0.235
2.51PheAsp: 2.51 ± 0.486
3.095PheGlu: 3.095 ± 0.494
1.339PhePhe: 1.339 ± 0.355
2.51PheGly: 2.51 ± 0.447
1.004PheHis: 1.004 ± 0.29
2.426PheIle: 2.426 ± 0.493
3.179PheLys: 3.179 ± 0.622
2.761PheLeu: 2.761 ± 0.568
0.753PheMet: 0.753 ± 0.244
2.677PheAsn: 2.677 ± 0.441
1.171PhePro: 1.171 ± 0.326
1.422PheGln: 1.422 ± 0.439
2.008PheArg: 2.008 ± 0.384
2.343PheSer: 2.343 ± 0.453
2.343PheThr: 2.343 ± 0.541
2.092PheVal: 2.092 ± 0.441
0.418PheTrp: 0.418 ± 0.207
2.761PheTyr: 2.761 ± 0.601
0.0PheXaa: 0.0 ± 0.0
Gly
3.179GlyAla: 3.179 ± 0.508
0.418GlyCys: 0.418 ± 0.163
3.514GlyAsp: 3.514 ± 0.578
4.016GlyGlu: 4.016 ± 0.63
2.677GlyPhe: 2.677 ± 0.416
3.681GlyGly: 3.681 ± 0.848
1.757GlyHis: 1.757 ± 0.417
5.354GlyIle: 5.354 ± 0.629
5.522GlyLys: 5.522 ± 0.615
5.856GlyLeu: 5.856 ± 0.863
1.924GlyMet: 1.924 ± 0.473
2.928GlyAsn: 2.928 ± 0.571
0.669GlyPro: 0.669 ± 0.231
2.928GlyGln: 2.928 ± 0.517
3.681GlyArg: 3.681 ± 0.492
3.681GlySer: 3.681 ± 0.492
4.016GlyThr: 4.016 ± 0.514
4.685GlyVal: 4.685 ± 0.743
0.502GlyTrp: 0.502 ± 0.171
3.932GlyTyr: 3.932 ± 0.562
0.0GlyXaa: 0.0 ± 0.0
His
0.92HisAla: 0.92 ± 0.235
0.335HisCys: 0.335 ± 0.157
1.171HisAsp: 1.171 ± 0.338
1.506HisGlu: 1.506 ± 0.408
0.669HisPhe: 0.669 ± 0.243
1.59HisGly: 1.59 ± 0.323
0.753HisHis: 0.753 ± 0.251
1.171HisIle: 1.171 ± 0.328
0.669HisLys: 0.669 ± 0.207
1.59HisLeu: 1.59 ± 0.287
0.502HisMet: 0.502 ± 0.215
0.837HisAsn: 0.837 ± 0.266
0.837HisPro: 0.837 ± 0.295
1.088HisGln: 1.088 ± 0.397
0.837HisArg: 0.837 ± 0.332
0.669HisSer: 0.669 ± 0.264
1.506HisThr: 1.506 ± 0.307
0.92HisVal: 0.92 ± 0.278
0.167HisTrp: 0.167 ± 0.125
1.422HisTyr: 1.422 ± 0.454
0.0HisXaa: 0.0 ± 0.0
Ile
4.936IleAla: 4.936 ± 0.446
0.837IleCys: 0.837 ± 0.235
6.024IleAsp: 6.024 ± 0.529
5.02IleGlu: 5.02 ± 1.123
2.426IlePhe: 2.426 ± 0.61
4.267IleGly: 4.267 ± 0.619
0.753IleHis: 0.753 ± 0.233
4.434IleIle: 4.434 ± 0.728
4.601IleLys: 4.601 ± 0.708
5.271IleLeu: 5.271 ± 0.584
0.92IleMet: 0.92 ± 0.417
2.844IleAsn: 2.844 ± 0.474
1.924IlePro: 1.924 ± 0.306
2.928IleGln: 2.928 ± 0.466
2.761IleArg: 2.761 ± 0.448
5.94IleSer: 5.94 ± 0.921
4.267IleThr: 4.267 ± 0.928
5.103IleVal: 5.103 ± 0.664
1.088IleTrp: 1.088 ± 0.35
2.343IleTyr: 2.343 ± 0.645
0.0IleXaa: 0.0 ± 0.0
Lys
5.856LysAla: 5.856 ± 0.635
0.669LysCys: 0.669 ± 0.186
3.179LysAsp: 3.179 ± 0.603
5.438LysGlu: 5.438 ± 0.68
2.844LysPhe: 2.844 ± 0.488
4.518LysGly: 4.518 ± 0.544
1.841LysHis: 1.841 ± 0.354
5.187LysIle: 5.187 ± 0.626
6.024LysLys: 6.024 ± 0.903
6.107LysLeu: 6.107 ± 0.792
2.175LysMet: 2.175 ± 0.594
2.677LysAsn: 2.677 ± 0.563
2.51LysPro: 2.51 ± 0.511
2.761LysGln: 2.761 ± 0.482
4.35LysArg: 4.35 ± 0.552
3.765LysSer: 3.765 ± 0.546
4.434LysThr: 4.434 ± 0.437
5.605LysVal: 5.605 ± 0.768
1.255LysTrp: 1.255 ± 0.314
2.761LysTyr: 2.761 ± 0.525
0.0LysXaa: 0.0 ± 0.0
Leu
4.936LeuAla: 4.936 ± 0.645
0.586LeuCys: 0.586 ± 0.253
5.522LeuAsp: 5.522 ± 0.626
7.028LeuGlu: 7.028 ± 0.875
2.844LeuPhe: 2.844 ± 0.636
5.856LeuGly: 5.856 ± 0.799
1.171LeuHis: 1.171 ± 0.301
4.852LeuIle: 4.852 ± 0.739
7.111LeuLys: 7.111 ± 0.844
7.529LeuLeu: 7.529 ± 1.316
2.092LeuMet: 2.092 ± 0.282
4.183LeuAsn: 4.183 ± 0.537
2.844LeuPro: 2.844 ± 0.516
3.095LeuGln: 3.095 ± 0.442
3.681LeuArg: 3.681 ± 0.665
7.028LeuSer: 7.028 ± 0.837
6.191LeuThr: 6.191 ± 0.549
6.191LeuVal: 6.191 ± 0.665
0.502LeuTrp: 0.502 ± 0.146
3.514LeuTyr: 3.514 ± 0.593
0.0LeuXaa: 0.0 ± 0.0
Met
2.259MetAla: 2.259 ± 0.514
0.167MetCys: 0.167 ± 0.116
1.59MetAsp: 1.59 ± 0.39
1.757MetGlu: 1.757 ± 0.433
0.837MetPhe: 0.837 ± 0.246
1.924MetGly: 1.924 ± 0.471
0.251MetHis: 0.251 ± 0.172
1.506MetIle: 1.506 ± 0.412
1.757MetLys: 1.757 ± 0.363
1.506MetLeu: 1.506 ± 0.421
0.753MetMet: 0.753 ± 0.275
1.088MetAsn: 1.088 ± 0.309
0.753MetPro: 0.753 ± 0.328
0.669MetGln: 0.669 ± 0.263
1.422MetArg: 1.422 ± 0.36
1.506MetSer: 1.506 ± 0.393
1.506MetThr: 1.506 ± 0.396
1.59MetVal: 1.59 ± 0.381
0.167MetTrp: 0.167 ± 0.115
0.586MetTyr: 0.586 ± 0.234
0.0MetXaa: 0.0 ± 0.0
Asn
4.183AsnAla: 4.183 ± 0.647
0.418AsnCys: 0.418 ± 0.189
3.012AsnAsp: 3.012 ± 0.455
2.593AsnGlu: 2.593 ± 0.483
2.008AsnPhe: 2.008 ± 0.401
4.601AsnGly: 4.601 ± 0.638
1.59AsnHis: 1.59 ± 0.376
3.263AsnIle: 3.263 ± 0.457
3.263AsnLys: 3.263 ± 0.448
4.267AsnLeu: 4.267 ± 0.979
0.753AsnMet: 0.753 ± 0.227
3.346AsnAsn: 3.346 ± 0.553
2.175AsnPro: 2.175 ± 0.334
2.259AsnGln: 2.259 ± 0.332
2.092AsnArg: 2.092 ± 0.474
3.095AsnSer: 3.095 ± 0.626
2.51AsnThr: 2.51 ± 0.503
2.343AsnVal: 2.343 ± 0.551
1.255AsnTrp: 1.255 ± 0.369
1.757AsnTyr: 1.757 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
1.59ProAla: 1.59 ± 0.325
0.418ProCys: 0.418 ± 0.183
1.422ProAsp: 1.422 ± 0.367
2.677ProGlu: 2.677 ± 0.431
1.255ProPhe: 1.255 ± 0.27
0.586ProGly: 0.586 ± 0.238
0.418ProHis: 0.418 ± 0.183
1.757ProIle: 1.757 ± 0.292
2.761ProLys: 2.761 ± 0.465
2.092ProLeu: 2.092 ± 0.376
0.502ProMet: 0.502 ± 0.182
2.092ProAsn: 2.092 ± 0.383
0.586ProPro: 0.586 ± 0.26
1.171ProGln: 1.171 ± 0.415
1.339ProArg: 1.339 ± 0.334
2.593ProSer: 2.593 ± 0.528
2.426ProThr: 2.426 ± 0.428
2.343ProVal: 2.343 ± 0.362
0.418ProTrp: 0.418 ± 0.165
1.422ProTyr: 1.422 ± 0.334
0.0ProXaa: 0.0 ± 0.0
Gln
3.179GlnAla: 3.179 ± 0.564
0.502GlnCys: 0.502 ± 0.214
1.757GlnAsp: 1.757 ± 0.413
2.593GlnGlu: 2.593 ± 0.342
1.924GlnPhe: 1.924 ± 0.362
2.175GlnGly: 2.175 ± 0.494
0.753GlnHis: 0.753 ± 0.188
2.092GlnIle: 2.092 ± 0.512
3.012GlnLys: 3.012 ± 0.581
3.932GlnLeu: 3.932 ± 0.486
1.339GlnMet: 1.339 ± 0.341
2.677GlnAsn: 2.677 ± 0.409
1.171GlnPro: 1.171 ± 0.297
1.924GlnGln: 1.924 ± 0.337
2.092GlnArg: 2.092 ± 0.43
2.51GlnSer: 2.51 ± 0.556
2.928GlnThr: 2.928 ± 0.855
4.267GlnVal: 4.267 ± 0.544
0.92GlnTrp: 0.92 ± 0.3
0.837GlnTyr: 0.837 ± 0.387
0.0GlnXaa: 0.0 ± 0.0
Arg
1.841ArgAla: 1.841 ± 0.465
0.92ArgCys: 0.92 ± 0.27
2.844ArgAsp: 2.844 ± 0.467
2.928ArgGlu: 2.928 ± 0.394
1.673ArgPhe: 1.673 ± 0.394
2.259ArgGly: 2.259 ± 0.401
0.837ArgHis: 0.837 ± 0.272
3.765ArgIle: 3.765 ± 0.552
3.932ArgLys: 3.932 ± 0.771
4.936ArgLeu: 4.936 ± 0.585
0.837ArgMet: 0.837 ± 0.314
2.761ArgAsn: 2.761 ± 0.368
1.422ArgPro: 1.422 ± 0.328
2.51ArgGln: 2.51 ± 0.372
2.175ArgArg: 2.175 ± 0.504
3.263ArgSer: 3.263 ± 0.294
2.844ArgThr: 2.844 ± 0.586
3.263ArgVal: 3.263 ± 0.572
0.92ArgTrp: 0.92 ± 0.327
2.008ArgTyr: 2.008 ± 0.526
0.0ArgXaa: 0.0 ± 0.0
Ser
3.848SerAla: 3.848 ± 0.521
0.586SerCys: 0.586 ± 0.214
3.765SerAsp: 3.765 ± 0.654
4.601SerGlu: 4.601 ± 0.648
2.844SerPhe: 2.844 ± 0.561
5.522SerGly: 5.522 ± 0.717
1.673SerHis: 1.673 ± 0.313
5.438SerIle: 5.438 ± 0.787
3.848SerLys: 3.848 ± 0.605
5.187SerLeu: 5.187 ± 0.751
1.59SerMet: 1.59 ± 0.294
2.092SerAsn: 2.092 ± 0.454
1.841SerPro: 1.841 ± 0.345
2.51SerGln: 2.51 ± 0.858
2.928SerArg: 2.928 ± 0.484
4.35SerSer: 4.35 ± 0.952
4.267SerThr: 4.267 ± 0.565
4.099SerVal: 4.099 ± 0.621
0.753SerTrp: 0.753 ± 0.202
2.51SerTyr: 2.51 ± 0.552
0.0SerXaa: 0.0 ± 0.0
Thr
4.016ThrAla: 4.016 ± 0.707
0.084ThrCys: 0.084 ± 0.086
3.765ThrAsp: 3.765 ± 0.61
4.434ThrGlu: 4.434 ± 0.555
2.259ThrPhe: 2.259 ± 0.342
4.35ThrGly: 4.35 ± 0.766
0.92ThrHis: 0.92 ± 0.281
4.769ThrIle: 4.769 ± 0.686
3.848ThrLys: 3.848 ± 0.584
4.936ThrLeu: 4.936 ± 0.576
1.088ThrMet: 1.088 ± 0.314
2.928ThrAsn: 2.928 ± 0.563
2.761ThrPro: 2.761 ± 0.616
2.175ThrGln: 2.175 ± 0.868
2.593ThrArg: 2.593 ± 0.46
4.601ThrSer: 4.601 ± 1.128
4.434ThrThr: 4.434 ± 0.975
4.936ThrVal: 4.936 ± 0.819
0.669ThrTrp: 0.669 ± 0.288
2.426ThrTyr: 2.426 ± 0.48
0.0ThrXaa: 0.0 ± 0.0
Val
4.434ValAla: 4.434 ± 0.695
0.753ValCys: 0.753 ± 0.293
4.35ValAsp: 4.35 ± 0.64
4.434ValGlu: 4.434 ± 0.743
2.092ValPhe: 2.092 ± 0.451
3.263ValGly: 3.263 ± 0.609
1.171ValHis: 1.171 ± 0.289
4.099ValIle: 4.099 ± 0.566
4.769ValLys: 4.769 ± 0.504
6.693ValLeu: 6.693 ± 0.579
1.255ValMet: 1.255 ± 0.292
3.514ValAsn: 3.514 ± 0.527
2.259ValPro: 2.259 ± 0.441
2.928ValGln: 2.928 ± 0.628
3.848ValArg: 3.848 ± 0.553
3.932ValSer: 3.932 ± 0.663
4.35ValThr: 4.35 ± 0.695
3.681ValVal: 3.681 ± 0.565
1.088ValTrp: 1.088 ± 0.29
2.426ValTyr: 2.426 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
1.004TrpAla: 1.004 ± 0.302
0.167TrpCys: 0.167 ± 0.108
0.669TrpAsp: 0.669 ± 0.225
1.088TrpGlu: 1.088 ± 0.211
0.586TrpPhe: 0.586 ± 0.236
0.669TrpGly: 0.669 ± 0.223
0.418TrpHis: 0.418 ± 0.183
0.502TrpIle: 0.502 ± 0.178
1.088TrpLys: 1.088 ± 0.313
1.422TrpLeu: 1.422 ± 0.267
0.502TrpMet: 0.502 ± 0.15
0.837TrpAsn: 0.837 ± 0.327
0.0TrpPro: 0.0 ± 0.0
0.669TrpGln: 0.669 ± 0.247
0.837TrpArg: 0.837 ± 0.299
0.837TrpSer: 0.837 ± 0.317
1.004TrpThr: 1.004 ± 0.367
0.586TrpVal: 0.586 ± 0.224
0.167TrpTrp: 0.167 ± 0.123
0.084TrpTyr: 0.084 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.343TyrAla: 2.343 ± 0.433
0.753TyrCys: 0.753 ± 0.282
3.932TyrAsp: 3.932 ± 0.741
2.426TyrGlu: 2.426 ± 0.612
1.506TyrPhe: 1.506 ± 0.365
2.677TyrGly: 2.677 ± 0.637
1.004TyrHis: 1.004 ± 0.392
2.426TyrIle: 2.426 ± 0.623
3.012TyrLys: 3.012 ± 0.652
4.267TyrLeu: 4.267 ± 0.597
0.92TyrMet: 0.92 ± 0.382
2.259TyrAsn: 2.259 ± 0.417
1.59TyrPro: 1.59 ± 0.262
2.761TyrGln: 2.761 ± 0.382
2.343TyrArg: 2.343 ± 0.535
2.593TyrSer: 2.593 ± 0.41
2.426TyrThr: 2.426 ± 0.807
1.924TyrVal: 1.924 ± 0.47
0.502TyrTrp: 0.502 ± 0.189
1.255TyrTyr: 1.255 ± 0.464
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 29 proteins (11954 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski