Amino acid dipepetide frequency for Escherichia phage PDX

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.996AlaAla: 6.996 ± 1.542
0.451AlaCys: 0.451 ± 0.326
4.965AlaAsp: 4.965 ± 0.688
5.191AlaGlu: 5.191 ± 1.358
2.257AlaPhe: 2.257 ± 0.616
6.996AlaGly: 6.996 ± 1.82
1.58AlaHis: 1.58 ± 0.386
3.16AlaIle: 3.16 ± 0.767
5.642AlaLys: 5.642 ± 1.224
5.642AlaLeu: 5.642 ± 1.097
2.708AlaMet: 2.708 ± 0.952
5.416AlaAsn: 5.416 ± 1.426
3.385AlaPro: 3.385 ± 0.805
3.385AlaGln: 3.385 ± 1.039
3.16AlaArg: 3.16 ± 0.8
4.739AlaSer: 4.739 ± 0.957
5.191AlaThr: 5.191 ± 0.595
4.965AlaVal: 4.965 ± 0.86
1.354AlaTrp: 1.354 ± 0.422
2.934AlaTyr: 2.934 ± 0.766
0.0AlaXaa: 0.0 ± 0.0
Cys
1.354CysAla: 1.354 ± 0.593
0.226CysCys: 0.226 ± 0.162
0.0CysAsp: 0.0 ± 0.0
1.128CysGlu: 1.128 ± 0.897
0.226CysPhe: 0.226 ± 0.162
0.677CysGly: 0.677 ± 0.319
0.677CysHis: 0.677 ± 0.347
0.0CysIle: 0.0 ± 0.0
0.226CysLys: 0.226 ± 0.236
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.677CysAsn: 0.677 ± 0.546
0.451CysPro: 0.451 ± 0.346
0.451CysGln: 0.451 ± 0.389
1.354CysArg: 1.354 ± 0.497
0.226CysSer: 0.226 ± 0.278
0.451CysThr: 0.451 ± 0.246
0.677CysVal: 0.677 ± 0.294
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.514AspAla: 4.514 ± 0.643
0.451AspCys: 0.451 ± 0.246
4.514AspAsp: 4.514 ± 1.303
3.385AspGlu: 3.385 ± 0.657
2.708AspPhe: 2.708 ± 0.564
6.093AspGly: 6.093 ± 1.269
0.903AspHis: 0.903 ± 0.545
2.257AspIle: 2.257 ± 0.655
4.965AspLys: 4.965 ± 1.437
5.191AspLeu: 5.191 ± 0.849
2.031AspMet: 2.031 ± 0.939
2.934AspAsn: 2.934 ± 0.582
4.062AspPro: 4.062 ± 0.937
1.128AspGln: 1.128 ± 0.398
2.257AspArg: 2.257 ± 0.764
3.837AspSer: 3.837 ± 1.25
3.837AspThr: 3.837 ± 0.759
5.868AspVal: 5.868 ± 0.618
1.58AspTrp: 1.58 ± 0.432
3.16AspTyr: 3.16 ± 0.437
0.0AspXaa: 0.0 ± 0.0
Glu
4.965GluAla: 4.965 ± 0.799
0.451GluCys: 0.451 ± 0.272
5.416GluAsp: 5.416 ± 0.789
4.288GluGlu: 4.288 ± 0.982
2.031GluPhe: 2.031 ± 0.527
4.965GluGly: 4.965 ± 1.013
1.58GluHis: 1.58 ± 0.604
2.483GluIle: 2.483 ± 0.593
2.708GluLys: 2.708 ± 0.856
5.416GluLeu: 5.416 ± 1.408
1.58GluMet: 1.58 ± 0.981
2.483GluAsn: 2.483 ± 0.867
1.354GluPro: 1.354 ± 0.551
3.385GluGln: 3.385 ± 0.671
2.934GluArg: 2.934 ± 0.796
3.611GluSer: 3.611 ± 0.632
2.257GluThr: 2.257 ± 1.053
4.514GluVal: 4.514 ± 0.909
0.903GluTrp: 0.903 ± 0.355
1.805GluTyr: 1.805 ± 0.614
0.0GluXaa: 0.0 ± 0.0
Phe
2.483PheAla: 2.483 ± 0.699
0.0PheCys: 0.0 ± 0.0
1.805PheAsp: 1.805 ± 0.877
2.257PheGlu: 2.257 ± 0.905
0.903PhePhe: 0.903 ± 0.615
2.483PheGly: 2.483 ± 0.558
0.226PheHis: 0.226 ± 0.231
1.128PheIle: 1.128 ± 0.573
3.385PheLys: 3.385 ± 0.545
2.257PheLeu: 2.257 ± 0.733
0.451PheMet: 0.451 ± 0.271
1.128PheAsn: 1.128 ± 0.367
1.354PhePro: 1.354 ± 0.516
0.903PheGln: 0.903 ± 0.432
2.483PheArg: 2.483 ± 0.546
2.257PheSer: 2.257 ± 0.7
2.934PheThr: 2.934 ± 0.525
1.805PheVal: 1.805 ± 0.583
0.677PheTrp: 0.677 ± 0.322
2.483PheTyr: 2.483 ± 1.048
0.0PheXaa: 0.0 ± 0.0
Gly
6.996GlyAla: 6.996 ± 1.504
0.451GlyCys: 0.451 ± 0.346
6.545GlyAsp: 6.545 ± 1.091
3.16GlyGlu: 3.16 ± 1.106
3.16GlyPhe: 3.16 ± 0.802
6.996GlyGly: 6.996 ± 1.786
1.354GlyHis: 1.354 ± 0.589
5.642GlyIle: 5.642 ± 1.097
4.062GlyLys: 4.062 ± 0.789
4.288GlyLeu: 4.288 ± 0.779
2.483GlyMet: 2.483 ± 0.654
4.965GlyAsn: 4.965 ± 1.478
2.031GlyPro: 2.031 ± 0.919
2.934GlyGln: 2.934 ± 0.452
2.934GlyArg: 2.934 ± 0.829
4.288GlySer: 4.288 ± 1.012
7.899GlyThr: 7.899 ± 2.612
6.996GlyVal: 6.996 ± 1.338
0.903GlyTrp: 0.903 ± 0.285
4.514GlyTyr: 4.514 ± 1.326
0.0GlyXaa: 0.0 ± 0.0
His
1.128HisAla: 1.128 ± 0.631
0.226HisCys: 0.226 ± 0.173
0.677HisAsp: 0.677 ± 0.372
1.805HisGlu: 1.805 ± 0.729
0.903HisPhe: 0.903 ± 0.379
1.128HisGly: 1.128 ± 0.472
0.677HisHis: 0.677 ± 0.373
0.903HisIle: 0.903 ± 0.379
1.805HisLys: 1.805 ± 0.578
2.031HisLeu: 2.031 ± 0.495
0.451HisMet: 0.451 ± 0.193
1.128HisAsn: 1.128 ± 0.644
0.903HisPro: 0.903 ± 0.437
0.677HisGln: 0.677 ± 0.534
1.354HisArg: 1.354 ± 0.444
1.128HisSer: 1.128 ± 0.535
1.805HisThr: 1.805 ± 0.416
1.128HisVal: 1.128 ± 0.475
0.677HisTrp: 0.677 ± 0.525
1.354HisTyr: 1.354 ± 0.669
0.0HisXaa: 0.0 ± 0.0
Ile
6.319IleAla: 6.319 ± 1.417
0.677IleCys: 0.677 ± 0.818
3.611IleAsp: 3.611 ± 1.196
5.191IleGlu: 5.191 ± 1.331
1.354IlePhe: 1.354 ± 0.486
3.837IleGly: 3.837 ± 0.612
1.128IleHis: 1.128 ± 0.544
2.257IleIle: 2.257 ± 0.521
3.837IleLys: 3.837 ± 1.481
1.805IleLeu: 1.805 ± 0.733
0.677IleMet: 0.677 ± 0.324
3.837IleAsn: 3.837 ± 1.213
2.934IlePro: 2.934 ± 0.505
0.903IleGln: 0.903 ± 0.52
2.934IleArg: 2.934 ± 0.493
3.385IleSer: 3.385 ± 1.025
4.288IleThr: 4.288 ± 1.05
4.062IleVal: 4.062 ± 0.909
0.226IleTrp: 0.226 ± 0.246
2.257IleTyr: 2.257 ± 0.383
0.0IleXaa: 0.0 ± 0.0
Lys
5.642LysAla: 5.642 ± 0.881
0.677LysCys: 0.677 ± 0.323
4.514LysAsp: 4.514 ± 1.211
3.837LysGlu: 3.837 ± 1.193
2.483LysPhe: 2.483 ± 0.732
4.288LysGly: 4.288 ± 0.617
1.354LysHis: 1.354 ± 0.623
4.288LysIle: 4.288 ± 0.877
2.934LysLys: 2.934 ± 1.151
3.837LysLeu: 3.837 ± 1.222
1.58LysMet: 1.58 ± 0.604
3.385LysAsn: 3.385 ± 1.465
1.354LysPro: 1.354 ± 0.9
2.708LysGln: 2.708 ± 0.82
1.805LysArg: 1.805 ± 0.542
2.708LysSer: 2.708 ± 0.511
3.611LysThr: 3.611 ± 1.374
4.514LysVal: 4.514 ± 1.242
0.903LysTrp: 0.903 ± 0.335
2.257LysTyr: 2.257 ± 0.539
0.0LysXaa: 0.0 ± 0.0
Leu
4.288LeuAla: 4.288 ± 0.851
1.58LeuCys: 1.58 ± 0.663
4.739LeuAsp: 4.739 ± 0.951
5.416LeuGlu: 5.416 ± 1.34
2.031LeuPhe: 2.031 ± 0.62
4.288LeuGly: 4.288 ± 1.027
3.611LeuHis: 3.611 ± 1.101
4.062LeuIle: 4.062 ± 0.9
4.514LeuLys: 4.514 ± 1.18
2.031LeuLeu: 2.031 ± 0.944
1.128LeuMet: 1.128 ± 0.373
3.611LeuAsn: 3.611 ± 0.536
2.708LeuPro: 2.708 ± 0.737
3.385LeuGln: 3.385 ± 0.901
3.611LeuArg: 3.611 ± 0.951
3.16LeuSer: 3.16 ± 0.829
5.642LeuThr: 5.642 ± 1.54
3.385LeuVal: 3.385 ± 1.132
0.677LeuTrp: 0.677 ± 0.417
2.708LeuTyr: 2.708 ± 0.551
0.0LeuXaa: 0.0 ± 0.0
Met
1.58MetAla: 1.58 ± 0.617
0.0MetCys: 0.0 ± 0.0
1.58MetAsp: 1.58 ± 0.408
1.805MetGlu: 1.805 ± 0.484
1.128MetPhe: 1.128 ± 0.683
1.805MetGly: 1.805 ± 0.74
0.903MetHis: 0.903 ± 0.321
1.58MetIle: 1.58 ± 0.387
2.257MetLys: 2.257 ± 0.79
0.677MetLeu: 0.677 ± 0.335
0.226MetMet: 0.226 ± 0.162
1.128MetAsn: 1.128 ± 0.314
2.031MetPro: 2.031 ± 0.439
0.451MetGln: 0.451 ± 0.346
0.677MetArg: 0.677 ± 0.446
1.128MetSer: 1.128 ± 0.356
2.708MetThr: 2.708 ± 0.51
2.708MetVal: 2.708 ± 0.78
0.451MetTrp: 0.451 ± 0.309
0.903MetTyr: 0.903 ± 0.274
0.0MetXaa: 0.0 ± 0.0
Asn
4.288AsnAla: 4.288 ± 0.833
0.677AsnCys: 0.677 ± 0.33
3.16AsnAsp: 3.16 ± 0.49
1.128AsnGlu: 1.128 ± 0.31
2.934AsnPhe: 2.934 ± 0.783
7.222AsnGly: 7.222 ± 1.834
0.677AsnHis: 0.677 ± 0.516
3.16AsnIle: 3.16 ± 0.788
2.483AsnLys: 2.483 ± 1.129
4.288AsnLeu: 4.288 ± 0.807
1.354AsnMet: 1.354 ± 0.475
5.868AsnAsn: 5.868 ± 1.773
2.031AsnPro: 2.031 ± 0.654
2.257AsnGln: 2.257 ± 0.55
1.58AsnArg: 1.58 ± 0.639
2.257AsnSer: 2.257 ± 0.826
4.739AsnThr: 4.739 ± 0.471
3.837AsnVal: 3.837 ± 0.576
0.903AsnTrp: 0.903 ± 0.32
1.58AsnTyr: 1.58 ± 0.519
0.0AsnXaa: 0.0 ± 0.0
Pro
3.16ProAla: 3.16 ± 1.052
0.0ProCys: 0.0 ± 0.0
1.805ProAsp: 1.805 ± 0.581
2.257ProGlu: 2.257 ± 0.532
1.128ProPhe: 1.128 ± 0.315
2.031ProGly: 2.031 ± 0.615
0.451ProHis: 0.451 ± 0.218
2.483ProIle: 2.483 ± 0.829
2.257ProLys: 2.257 ± 0.69
2.934ProLeu: 2.934 ± 0.956
2.257ProMet: 2.257 ± 0.727
1.805ProAsn: 1.805 ± 0.57
1.128ProPro: 1.128 ± 0.696
2.483ProGln: 2.483 ± 0.783
1.58ProArg: 1.58 ± 0.367
2.934ProSer: 2.934 ± 0.754
3.16ProThr: 3.16 ± 0.703
2.257ProVal: 2.257 ± 0.803
0.677ProTrp: 0.677 ± 0.319
2.031ProTyr: 2.031 ± 0.45
0.0ProXaa: 0.0 ± 0.0
Gln
3.611GlnAla: 3.611 ± 0.498
0.226GlnCys: 0.226 ± 0.173
3.611GlnAsp: 3.611 ± 0.83
2.483GlnGlu: 2.483 ± 0.879
0.903GlnPhe: 0.903 ± 0.358
3.16GlnGly: 3.16 ± 1.357
0.451GlnHis: 0.451 ± 0.272
1.805GlnIle: 1.805 ± 0.667
2.031GlnLys: 2.031 ± 1.006
2.934GlnLeu: 2.934 ± 0.6
0.903GlnMet: 0.903 ± 0.532
2.031GlnAsn: 2.031 ± 0.872
2.031GlnPro: 2.031 ± 0.697
2.257GlnGln: 2.257 ± 1.83
1.354GlnArg: 1.354 ± 0.328
2.031GlnSer: 2.031 ± 0.688
0.903GlnThr: 0.903 ± 0.691
2.934GlnVal: 2.934 ± 0.901
0.903GlnTrp: 0.903 ± 0.436
1.354GlnTyr: 1.354 ± 0.664
0.0GlnXaa: 0.0 ± 0.0
Arg
2.483ArgAla: 2.483 ± 0.639
0.0ArgCys: 0.0 ± 0.0
2.031ArgAsp: 2.031 ± 0.758
2.934ArgGlu: 2.934 ± 0.97
1.805ArgPhe: 1.805 ± 0.532
3.16ArgGly: 3.16 ± 0.633
0.903ArgHis: 0.903 ± 0.418
3.385ArgIle: 3.385 ± 0.62
3.385ArgLys: 3.385 ± 0.672
2.934ArgLeu: 2.934 ± 1.32
0.677ArgMet: 0.677 ± 0.495
2.257ArgAsn: 2.257 ± 0.569
1.128ArgPro: 1.128 ± 0.566
1.354ArgGln: 1.354 ± 0.4
2.934ArgArg: 2.934 ± 1.602
2.934ArgSer: 2.934 ± 0.564
2.257ArgThr: 2.257 ± 0.854
4.739ArgVal: 4.739 ± 1.256
0.677ArgTrp: 0.677 ± 0.39
1.805ArgTyr: 1.805 ± 0.847
0.0ArgXaa: 0.0 ± 0.0
Ser
4.514SerAla: 4.514 ± 0.824
0.451SerCys: 0.451 ± 0.325
2.257SerAsp: 2.257 ± 0.338
3.16SerGlu: 3.16 ± 0.58
2.708SerPhe: 2.708 ± 0.688
4.965SerGly: 4.965 ± 1.034
1.354SerHis: 1.354 ± 0.447
3.611SerIle: 3.611 ± 0.968
2.708SerLys: 2.708 ± 0.501
4.514SerLeu: 4.514 ± 1.224
1.805SerMet: 1.805 ± 0.481
2.708SerAsn: 2.708 ± 1.052
1.805SerPro: 1.805 ± 0.494
2.257SerGln: 2.257 ± 0.673
2.257SerArg: 2.257 ± 0.733
4.965SerSer: 4.965 ± 0.979
3.837SerThr: 3.837 ± 1.106
4.739SerVal: 4.739 ± 1.223
0.226SerTrp: 0.226 ± 0.173
1.354SerTyr: 1.354 ± 0.459
0.0SerXaa: 0.0 ± 0.0
Thr
6.319ThrAla: 6.319 ± 1.498
0.226ThrCys: 0.226 ± 0.278
3.385ThrAsp: 3.385 ± 0.597
3.837ThrGlu: 3.837 ± 0.934
2.031ThrPhe: 2.031 ± 0.673
7.899ThrGly: 7.899 ± 1.335
0.903ThrHis: 0.903 ± 0.37
4.062ThrIle: 4.062 ± 1.197
2.483ThrLys: 2.483 ± 0.89
5.868ThrLeu: 5.868 ± 1.525
1.354ThrMet: 1.354 ± 0.486
3.16ThrAsn: 3.16 ± 0.862
3.16ThrPro: 3.16 ± 0.783
2.483ThrGln: 2.483 ± 0.795
2.934ThrArg: 2.934 ± 1.184
3.16ThrSer: 3.16 ± 0.955
6.093ThrThr: 6.093 ± 1.828
8.125ThrVal: 8.125 ± 1.254
1.128ThrTrp: 1.128 ± 0.369
3.837ThrTyr: 3.837 ± 0.852
0.0ThrXaa: 0.0 ± 0.0
Val
4.062ValAla: 4.062 ± 1.08
0.903ValCys: 0.903 ± 0.287
7.899ValAsp: 7.899 ± 2.002
3.385ValGlu: 3.385 ± 0.701
1.58ValPhe: 1.58 ± 0.471
6.093ValGly: 6.093 ± 1.008
1.805ValHis: 1.805 ± 0.449
6.093ValIle: 6.093 ± 1.047
5.191ValLys: 5.191 ± 0.647
5.416ValLeu: 5.416 ± 1.17
1.805ValMet: 1.805 ± 0.494
4.739ValAsn: 4.739 ± 1.169
1.805ValPro: 1.805 ± 0.473
2.257ValGln: 2.257 ± 0.862
3.16ValArg: 3.16 ± 0.687
3.385ValSer: 3.385 ± 0.547
5.868ValThr: 5.868 ± 1.344
7.222ValVal: 7.222 ± 1.095
1.58ValTrp: 1.58 ± 0.776
2.934ValTyr: 2.934 ± 0.684
0.0ValXaa: 0.0 ± 0.0
Trp
0.903TrpAla: 0.903 ± 0.345
0.226TrpCys: 0.226 ± 0.263
0.451TrpAsp: 0.451 ± 0.193
1.58TrpGlu: 1.58 ± 0.461
0.0TrpPhe: 0.0 ± 0.0
1.805TrpGly: 1.805 ± 0.987
0.451TrpHis: 0.451 ± 0.346
1.354TrpIle: 1.354 ± 0.739
0.226TrpLys: 0.226 ± 0.173
0.903TrpLeu: 0.903 ± 0.298
0.903TrpMet: 0.903 ± 0.394
0.903TrpAsn: 0.903 ± 0.502
0.226TrpPro: 0.226 ± 0.231
1.128TrpGln: 1.128 ± 0.544
0.903TrpArg: 0.903 ± 0.635
0.451TrpSer: 0.451 ± 0.353
1.354TrpThr: 1.354 ± 0.422
0.226TrpVal: 0.226 ± 0.173
0.451TrpTrp: 0.451 ± 0.381
1.128TrpTyr: 1.128 ± 0.279
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.514TyrAla: 4.514 ± 0.626
0.903TyrCys: 0.903 ± 0.533
2.708TyrAsp: 2.708 ± 0.886
0.903TyrGlu: 0.903 ± 0.446
1.128TyrPhe: 1.128 ± 0.313
2.708TyrGly: 2.708 ± 0.539
0.903TyrHis: 0.903 ± 0.49
1.805TyrIle: 1.805 ± 0.465
1.58TyrLys: 1.58 ± 0.438
3.611TyrLeu: 3.611 ± 0.925
1.128TyrMet: 1.128 ± 0.377
2.257TyrAsn: 2.257 ± 0.537
2.934TyrPro: 2.934 ± 0.654
1.128TyrGln: 1.128 ± 0.699
1.58TyrArg: 1.58 ± 0.583
3.837TyrSer: 3.837 ± 0.934
3.611TyrThr: 3.611 ± 0.888
2.483TyrVal: 2.483 ± 0.915
0.677TyrTrp: 0.677 ± 0.312
3.16TyrTyr: 3.16 ± 0.788
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (4432 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski