Amino acid dipepetide frequency for Streptococcus satellite phage Javan229

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.447AlaCys: 0.447 ± 0.463
3.578AlaAsp: 3.578 ± 1.734
3.131AlaGlu: 3.131 ± 1.257
3.131AlaPhe: 3.131 ± 0.846
4.919AlaGly: 4.919 ± 1.085
0.447AlaHis: 0.447 ± 0.372
8.05AlaIle: 8.05 ± 1.776
3.131AlaLys: 3.131 ± 0.773
3.131AlaLeu: 3.131 ± 1.432
1.342AlaMet: 1.342 ± 1.172
1.789AlaAsn: 1.789 ± 1.347
1.789AlaPro: 1.789 ± 0.794
0.894AlaGln: 0.894 ± 0.454
4.919AlaArg: 4.919 ± 0.92
4.472AlaSer: 4.472 ± 1.388
5.814AlaThr: 5.814 ± 1.441
1.342AlaVal: 1.342 ± 0.889
0.447AlaTrp: 0.447 ± 0.41
5.367AlaTyr: 5.367 ± 1.279
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.447CysAsp: 0.447 ± 0.525
0.447CysGlu: 0.447 ± 0.399
0.447CysPhe: 0.447 ± 0.576
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.894CysLys: 0.894 ± 0.624
1.342CysLeu: 1.342 ± 0.755
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.447CysPro: 0.447 ± 0.41
0.447CysGln: 0.447 ± 0.528
0.894CysArg: 0.894 ± 0.568
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.447CysTrp: 0.447 ± 0.372
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.342AspAla: 1.342 ± 0.728
0.0AspCys: 0.0 ± 0.0
5.367AspAsp: 5.367 ± 1.737
6.261AspGlu: 6.261 ± 1.745
2.683AspPhe: 2.683 ± 1.336
1.789AspGly: 1.789 ± 0.702
0.894AspHis: 0.894 ± 1.056
4.919AspIle: 4.919 ± 1.491
5.367AspLys: 5.367 ± 1.36
6.708AspLeu: 6.708 ± 1.696
3.131AspMet: 3.131 ± 1.343
4.919AspAsn: 4.919 ± 1.255
1.342AspPro: 1.342 ± 0.942
0.0AspGln: 0.0 ± 0.0
1.342AspArg: 1.342 ± 0.817
2.236AspSer: 2.236 ± 1.159
3.131AspThr: 3.131 ± 0.962
3.131AspVal: 3.131 ± 0.859
0.447AspTrp: 0.447 ± 0.372
5.814AspTyr: 5.814 ± 0.98
0.0AspXaa: 0.0 ± 0.0
Glu
3.578GluAla: 3.578 ± 1.991
0.447GluCys: 0.447 ± 0.525
6.261GluAsp: 6.261 ± 2.409
10.733GluGlu: 10.733 ± 3.491
4.025GluPhe: 4.025 ± 1.228
1.789GluGly: 1.789 ± 0.805
1.342GluHis: 1.342 ± 0.839
3.578GluIle: 3.578 ± 1.885
6.708GluLys: 6.708 ± 2.074
14.758GluLeu: 14.758 ± 4.078
0.894GluMet: 0.894 ± 0.797
2.236GluAsn: 2.236 ± 0.896
1.789GluPro: 1.789 ± 0.799
4.472GluGln: 4.472 ± 1.407
4.025GluArg: 4.025 ± 1.13
2.236GluSer: 2.236 ± 1.018
4.025GluThr: 4.025 ± 1.197
8.05GluVal: 8.05 ± 2.231
0.894GluTrp: 0.894 ± 0.545
3.131GluTyr: 3.131 ± 1.333
0.0GluXaa: 0.0 ± 0.0
Phe
0.894PheAla: 0.894 ± 0.648
0.894PheCys: 0.894 ± 0.763
2.236PheAsp: 2.236 ± 1.016
2.236PheGlu: 2.236 ± 0.698
1.789PhePhe: 1.789 ± 0.69
0.447PheGly: 0.447 ± 0.41
0.894PheHis: 0.894 ± 0.67
4.025PheIle: 4.025 ± 0.869
4.025PheLys: 4.025 ± 1.635
4.919PheLeu: 4.919 ± 1.867
1.342PheMet: 1.342 ± 0.8
3.131PheAsn: 3.131 ± 0.842
0.447PhePro: 0.447 ± 0.372
0.894PheGln: 0.894 ± 0.551
0.894PheArg: 0.894 ± 0.744
4.472PheSer: 4.472 ± 1.06
0.894PheThr: 0.894 ± 0.744
1.789PheVal: 1.789 ± 0.661
0.894PheTrp: 0.894 ± 0.579
4.472PheTyr: 4.472 ± 1.492
0.0PheXaa: 0.0 ± 0.0
Gly
1.342GlyAla: 1.342 ± 0.795
0.447GlyCys: 0.447 ± 0.41
3.131GlyAsp: 3.131 ± 1.325
3.131GlyGlu: 3.131 ± 1.207
0.894GlyPhe: 0.894 ± 0.621
1.342GlyGly: 1.342 ± 0.723
0.894GlyHis: 0.894 ± 0.615
2.236GlyIle: 2.236 ± 0.776
5.367GlyLys: 5.367 ± 1.753
4.025GlyLeu: 4.025 ± 1.111
0.447GlyMet: 0.447 ± 0.661
1.342GlyAsn: 1.342 ± 0.923
0.0GlyPro: 0.0 ± 0.0
0.894GlyGln: 0.894 ± 0.401
1.789GlyArg: 1.789 ± 0.931
1.342GlySer: 1.342 ± 0.836
3.131GlyThr: 3.131 ± 1.161
4.472GlyVal: 4.472 ± 1.383
0.894GlyTrp: 0.894 ± 0.616
3.131GlyTyr: 3.131 ± 1.095
0.0GlyXaa: 0.0 ± 0.0
His
1.342HisAla: 1.342 ± 1.23
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.236HisGlu: 2.236 ± 0.914
0.447HisPhe: 0.447 ± 0.372
0.894HisGly: 0.894 ± 0.67
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.342HisLys: 1.342 ± 0.775
1.789HisLeu: 1.789 ± 0.801
0.0HisMet: 0.0 ± 0.0
1.342HisAsn: 1.342 ± 1.02
0.447HisPro: 0.447 ± 0.372
0.894HisGln: 0.894 ± 0.744
0.447HisArg: 0.447 ± 0.372
0.447HisSer: 0.447 ± 0.41
0.447HisThr: 0.447 ± 0.41
1.342HisVal: 1.342 ± 0.879
0.447HisTrp: 0.447 ± 0.576
1.789HisTyr: 1.789 ± 1.02
0.0HisXaa: 0.0 ± 0.0
Ile
6.708IleAla: 6.708 ± 2.408
0.447IleCys: 0.447 ± 0.576
5.814IleAsp: 5.814 ± 1.876
5.367IleGlu: 5.367 ± 1.89
4.025IlePhe: 4.025 ± 1.593
2.236IleGly: 2.236 ± 0.642
0.894IleHis: 0.894 ± 0.454
4.919IleIle: 4.919 ± 1.049
8.945IleLys: 8.945 ± 1.734
5.367IleLeu: 5.367 ± 1.482
2.683IleMet: 2.683 ± 0.891
8.945IleAsn: 8.945 ± 2.018
4.025IlePro: 4.025 ± 1.339
3.578IleGln: 3.578 ± 1.338
1.789IleArg: 1.789 ± 0.749
1.789IleSer: 1.789 ± 1.244
5.367IleThr: 5.367 ± 1.441
1.342IleVal: 1.342 ± 0.446
0.894IleTrp: 0.894 ± 0.557
2.236IleTyr: 2.236 ± 0.848
0.0IleXaa: 0.0 ± 0.0
Lys
5.814LysAla: 5.814 ± 1.76
0.0LysCys: 0.0 ± 0.0
4.919LysAsp: 4.919 ± 1.049
10.733LysGlu: 10.733 ± 1.715
1.342LysPhe: 1.342 ± 0.655
3.131LysGly: 3.131 ± 0.912
0.894LysHis: 0.894 ± 0.82
8.05LysIle: 8.05 ± 2.107
10.733LysLys: 10.733 ± 1.912
7.156LysLeu: 7.156 ± 1.506
1.342LysMet: 1.342 ± 0.648
4.919LysAsn: 4.919 ± 1.953
2.683LysPro: 2.683 ± 1.138
6.708LysGln: 6.708 ± 1.675
4.919LysArg: 4.919 ± 1.093
5.367LysSer: 5.367 ± 1.186
6.261LysThr: 6.261 ± 1.279
4.472LysVal: 4.472 ± 1.236
0.894LysTrp: 0.894 ± 0.624
5.367LysTyr: 5.367 ± 1.317
0.0LysXaa: 0.0 ± 0.0
Leu
8.945LeuAla: 8.945 ± 1.869
0.0LeuCys: 0.0 ± 0.0
6.708LeuAsp: 6.708 ± 1.403
9.392LeuGlu: 9.392 ± 1.572
4.919LeuPhe: 4.919 ± 1.152
6.708LeuGly: 6.708 ± 1.644
1.789LeuHis: 1.789 ± 0.614
5.367LeuIle: 5.367 ± 1.711
9.392LeuLys: 9.392 ± 1.997
6.708LeuLeu: 6.708 ± 1.665
1.789LeuMet: 1.789 ± 0.846
9.392LeuAsn: 9.392 ± 2.82
3.578LeuPro: 3.578 ± 0.899
5.367LeuGln: 5.367 ± 1.999
5.367LeuArg: 5.367 ± 1.855
3.578LeuSer: 3.578 ± 2.029
4.472LeuThr: 4.472 ± 1.004
3.578LeuVal: 3.578 ± 1.008
0.894LeuTrp: 0.894 ± 0.613
5.367LeuTyr: 5.367 ± 1.651
0.0LeuXaa: 0.0 ± 0.0
Met
1.789MetAla: 1.789 ± 0.958
0.0MetCys: 0.0 ± 0.0
0.894MetAsp: 0.894 ± 0.638
1.789MetGlu: 1.789 ± 1.058
0.447MetPhe: 0.447 ± 0.372
0.447MetGly: 0.447 ± 0.525
0.0MetHis: 0.0 ± 0.0
1.342MetIle: 1.342 ± 0.983
1.342MetLys: 1.342 ± 0.886
4.025MetLeu: 4.025 ± 1.49
1.342MetMet: 1.342 ± 0.729
0.894MetAsn: 0.894 ± 0.401
0.894MetPro: 0.894 ± 0.721
1.342MetGln: 1.342 ± 0.782
0.0MetArg: 0.0 ± 0.0
1.342MetSer: 1.342 ± 0.663
3.578MetThr: 3.578 ± 0.795
0.447MetVal: 0.447 ± 0.446
0.0MetTrp: 0.0 ± 0.0
0.894MetTyr: 0.894 ± 0.666
0.0MetXaa: 0.0 ± 0.0
Asn
4.025AsnAla: 4.025 ± 0.85
0.894AsnCys: 0.894 ± 0.545
2.236AsnAsp: 2.236 ± 1.353
2.236AsnGlu: 2.236 ± 0.928
1.342AsnPhe: 1.342 ± 0.853
4.919AsnGly: 4.919 ± 1.245
1.342AsnHis: 1.342 ± 0.689
4.919AsnIle: 4.919 ± 1.233
6.708AsnLys: 6.708 ± 1.706
3.578AsnLeu: 3.578 ± 1.344
1.789AsnMet: 1.789 ± 0.728
2.236AsnAsn: 2.236 ± 0.824
3.578AsnPro: 3.578 ± 1.576
2.683AsnGln: 2.683 ± 0.701
4.919AsnArg: 4.919 ± 0.986
4.025AsnSer: 4.025 ± 1.617
4.025AsnThr: 4.025 ± 1.444
1.789AsnVal: 1.789 ± 0.646
0.0AsnTrp: 0.0 ± 0.0
1.342AsnTyr: 1.342 ± 0.808
0.0AsnXaa: 0.0 ± 0.0
Pro
2.683ProAla: 2.683 ± 1.05
0.0ProCys: 0.0 ± 0.0
2.236ProAsp: 2.236 ± 1.149
2.683ProGlu: 2.683 ± 0.903
2.236ProPhe: 2.236 ± 0.728
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
4.472ProIle: 4.472 ± 1.267
3.131ProLys: 3.131 ± 0.894
4.472ProLeu: 4.472 ± 1.791
0.447ProMet: 0.447 ± 0.399
1.789ProAsn: 1.789 ± 1.487
1.789ProPro: 1.789 ± 0.958
0.447ProGln: 0.447 ± 0.372
1.789ProArg: 1.789 ± 0.744
0.0ProSer: 0.0 ± 0.0
1.789ProThr: 1.789 ± 0.848
0.894ProVal: 0.894 ± 0.557
0.0ProTrp: 0.0 ± 0.0
0.894ProTyr: 0.894 ± 0.744
0.0ProXaa: 0.0 ± 0.0
Gln
3.578GlnAla: 3.578 ± 1.121
0.0GlnCys: 0.0 ± 0.0
2.236GlnAsp: 2.236 ± 1.139
4.472GlnGlu: 4.472 ± 1.66
1.789GlnPhe: 1.789 ± 0.829
1.789GlnGly: 1.789 ± 0.739
0.447GlnHis: 0.447 ± 0.41
4.472GlnIle: 4.472 ± 1.532
3.578GlnLys: 3.578 ± 1.154
4.919GlnLeu: 4.919 ± 0.872
0.0GlnMet: 0.0 ± 0.0
0.894GlnAsn: 0.894 ± 0.579
1.789GlnPro: 1.789 ± 0.974
1.789GlnGln: 1.789 ± 0.751
0.0GlnArg: 0.0 ± 0.0
3.131GlnSer: 3.131 ± 1.771
3.131GlnThr: 3.131 ± 1.266
3.131GlnVal: 3.131 ± 1.219
0.447GlnTrp: 0.447 ± 0.463
2.683GlnTyr: 2.683 ± 0.851
0.0GlnXaa: 0.0 ± 0.0
Arg
2.683ArgAla: 2.683 ± 1.046
0.0ArgCys: 0.0 ± 0.0
3.131ArgAsp: 3.131 ± 0.771
3.578ArgGlu: 3.578 ± 1.122
2.236ArgPhe: 2.236 ± 1.214
2.236ArgGly: 2.236 ± 0.999
1.789ArgHis: 1.789 ± 1.248
3.578ArgIle: 3.578 ± 0.964
3.578ArgLys: 3.578 ± 1.272
3.578ArgLeu: 3.578 ± 0.835
0.894ArgMet: 0.894 ± 0.503
0.894ArgAsn: 0.894 ± 0.746
0.894ArgPro: 0.894 ± 0.557
2.236ArgGln: 2.236 ± 0.966
2.683ArgArg: 2.683 ± 1.003
0.894ArgSer: 0.894 ± 0.613
3.578ArgThr: 3.578 ± 1.506
1.342ArgVal: 1.342 ± 0.689
0.447ArgTrp: 0.447 ± 0.542
2.236ArgTyr: 2.236 ± 1.342
0.0ArgXaa: 0.0 ± 0.0
Ser
2.683SerAla: 2.683 ± 1.165
0.894SerCys: 0.894 ± 0.613
4.472SerAsp: 4.472 ± 1.436
2.236SerGlu: 2.236 ± 0.844
2.683SerPhe: 2.683 ± 0.945
0.0SerGly: 0.0 ± 0.0
0.0SerHis: 0.0 ± 0.0
2.683SerIle: 2.683 ± 1.183
3.131SerLys: 3.131 ± 0.809
5.367SerLeu: 5.367 ± 1.691
1.342SerMet: 1.342 ± 0.691
1.789SerAsn: 1.789 ± 0.726
1.342SerPro: 1.342 ± 0.733
3.131SerGln: 3.131 ± 1.304
1.342SerArg: 1.342 ± 0.702
1.789SerSer: 1.789 ± 0.907
2.236SerThr: 2.236 ± 1.2
3.131SerVal: 3.131 ± 0.838
0.0SerTrp: 0.0 ± 0.0
3.131SerTyr: 3.131 ± 1.814
0.0SerXaa: 0.0 ± 0.0
Thr
4.025ThrAla: 4.025 ± 0.805
0.447ThrCys: 0.447 ± 0.399
2.683ThrAsp: 2.683 ± 0.966
5.814ThrGlu: 5.814 ± 1.305
2.683ThrPhe: 2.683 ± 1.344
2.683ThrGly: 2.683 ± 1.31
0.447ThrHis: 0.447 ± 0.41
7.156ThrIle: 7.156 ± 1.945
5.814ThrLys: 5.814 ± 1.932
6.708ThrLeu: 6.708 ± 2.19
0.0ThrMet: 0.0 ± 0.0
3.131ThrAsn: 3.131 ± 1.782
3.131ThrPro: 3.131 ± 2.066
0.894ThrGln: 0.894 ± 0.773
2.683ThrArg: 2.683 ± 0.743
2.236ThrSer: 2.236 ± 0.721
3.131ThrThr: 3.131 ± 0.681
3.578ThrVal: 3.578 ± 1.495
0.0ThrTrp: 0.0 ± 0.0
2.683ThrTyr: 2.683 ± 0.945
0.0ThrXaa: 0.0 ± 0.0
Val
2.683ValAla: 2.683 ± 0.971
0.0ValCys: 0.0 ± 0.0
1.342ValAsp: 1.342 ± 0.862
4.919ValGlu: 4.919 ± 1.488
2.683ValPhe: 2.683 ± 1.087
1.342ValGly: 1.342 ± 1.078
0.894ValHis: 0.894 ± 0.82
4.025ValIle: 4.025 ± 1.385
4.472ValLys: 4.472 ± 1.246
6.261ValLeu: 6.261 ± 1.047
2.236ValMet: 2.236 ± 1.225
4.025ValAsn: 4.025 ± 1.696
1.342ValPro: 1.342 ± 0.59
1.789ValGln: 1.789 ± 0.871
0.894ValArg: 0.894 ± 0.557
1.342ValSer: 1.342 ± 0.721
2.683ValThr: 2.683 ± 0.89
2.236ValVal: 2.236 ± 0.9
0.0ValTrp: 0.0 ± 0.0
2.683ValTyr: 2.683 ± 1.225
0.0ValXaa: 0.0 ± 0.0
Trp
0.447TrpAla: 0.447 ± 0.41
0.0TrpCys: 0.0 ± 0.0
0.447TrpAsp: 0.447 ± 0.41
0.447TrpGlu: 0.447 ± 0.542
0.447TrpPhe: 0.447 ± 0.505
0.447TrpGly: 0.447 ± 0.528
0.447TrpHis: 0.447 ± 0.446
0.447TrpIle: 0.447 ± 0.576
0.894TrpLys: 0.894 ± 0.744
0.894TrpLeu: 0.894 ± 0.744
0.0TrpMet: 0.0 ± 0.0
0.447TrpAsn: 0.447 ± 0.372
0.0TrpPro: 0.0 ± 0.0
0.894TrpGln: 0.894 ± 0.668
0.0TrpArg: 0.0 ± 0.0
0.447TrpSer: 0.447 ± 0.41
0.447TrpThr: 0.447 ± 0.463
0.894TrpVal: 0.894 ± 0.583
0.447TrpTrp: 0.447 ± 0.41
0.447TrpTyr: 0.447 ± 0.372
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.578TyrAla: 3.578 ± 0.89
0.894TyrCys: 0.894 ± 0.503
2.683TyrAsp: 2.683 ± 1.086
3.131TyrGlu: 3.131 ± 1.467
1.342TyrPhe: 1.342 ± 0.86
3.578TyrGly: 3.578 ± 1.234
2.236TyrHis: 2.236 ± 0.777
3.131TyrIle: 3.131 ± 0.795
7.156TyrLys: 7.156 ± 1.559
7.603TyrLeu: 7.603 ± 2.607
1.342TyrMet: 1.342 ± 0.78
4.472TyrAsn: 4.472 ± 1.05
0.447TyrPro: 0.447 ± 0.372
4.919TyrGln: 4.919 ± 1.839
1.789TyrArg: 1.789 ± 1.334
2.236TyrSer: 2.236 ± 0.812
1.789TyrThr: 1.789 ± 0.697
0.894TyrVal: 0.894 ± 0.82
0.447TyrTrp: 0.447 ± 0.372
2.683TyrTyr: 2.683 ± 0.94
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (2237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski