Amino acid dipepetide frequency for Streptococcus phage Javan515

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.814AlaAla: 2.814 ± 0.866
0.264AlaCys: 0.264 ± 0.137
4.573AlaAsp: 4.573 ± 0.563
6.948AlaGlu: 6.948 ± 0.755
2.814AlaPhe: 2.814 ± 0.386
3.606AlaGly: 3.606 ± 0.737
0.44AlaHis: 0.44 ± 0.228
6.157AlaIle: 6.157 ± 0.614
6.596AlaLys: 6.596 ± 0.727
4.661AlaLeu: 4.661 ± 0.539
1.759AlaMet: 1.759 ± 0.35
4.398AlaAsn: 4.398 ± 0.66
1.583AlaPro: 1.583 ± 0.295
1.847AlaGln: 1.847 ± 0.337
2.023AlaArg: 2.023 ± 0.458
4.222AlaSer: 4.222 ± 0.894
3.694AlaThr: 3.694 ± 0.689
3.518AlaVal: 3.518 ± 0.744
1.231AlaTrp: 1.231 ± 0.271
2.814AlaTyr: 2.814 ± 0.468
0.0AlaXaa: 0.0 ± 0.0
Cys
0.176CysAla: 0.176 ± 0.117
0.0CysCys: 0.0 ± 0.0
0.264CysAsp: 0.264 ± 0.153
0.528CysGlu: 0.528 ± 0.185
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.088CysHis: 0.088 ± 0.109
0.352CysIle: 0.352 ± 0.217
0.352CysLys: 0.352 ± 0.169
0.176CysLeu: 0.176 ± 0.129
0.088CysMet: 0.088 ± 0.1
0.352CysAsn: 0.352 ± 0.164
0.176CysPro: 0.176 ± 0.136
0.0CysGln: 0.0 ± 0.0
0.352CysArg: 0.352 ± 0.189
0.44CysSer: 0.44 ± 0.169
0.0CysThr: 0.0 ± 0.0
0.176CysVal: 0.176 ± 0.118
0.0CysTrp: 0.0 ± 0.0
0.616CysTyr: 0.616 ± 0.312
0.0CysXaa: 0.0 ± 0.0
Asp
4.222AspAla: 4.222 ± 0.623
0.528AspCys: 0.528 ± 0.226
4.134AspAsp: 4.134 ± 0.629
4.485AspGlu: 4.485 ± 0.664
2.814AspPhe: 2.814 ± 0.606
5.013AspGly: 5.013 ± 0.586
0.528AspHis: 0.528 ± 0.207
4.398AspIle: 4.398 ± 0.499
6.157AspLys: 6.157 ± 0.702
5.717AspLeu: 5.717 ± 0.704
1.231AspMet: 1.231 ± 0.391
5.101AspAsn: 5.101 ± 0.626
1.143AspPro: 1.143 ± 0.354
1.583AspGln: 1.583 ± 0.386
2.551AspArg: 2.551 ± 0.452
3.518AspSer: 3.518 ± 0.557
4.046AspThr: 4.046 ± 0.697
4.398AspVal: 4.398 ± 0.543
0.704AspTrp: 0.704 ± 0.252
3.87AspTyr: 3.87 ± 0.557
0.0AspXaa: 0.0 ± 0.0
Glu
4.837GluAla: 4.837 ± 0.818
0.088GluCys: 0.088 ± 0.081
2.639GluAsp: 2.639 ± 0.649
4.837GluGlu: 4.837 ± 0.793
2.726GluPhe: 2.726 ± 0.531
2.814GluGly: 2.814 ± 0.472
0.792GluHis: 0.792 ± 0.273
4.925GluIle: 4.925 ± 0.667
5.629GluLys: 5.629 ± 0.751
8.179GluLeu: 8.179 ± 0.916
2.023GluMet: 2.023 ± 0.324
3.518GluAsn: 3.518 ± 0.657
1.319GluPro: 1.319 ± 0.348
3.87GluGln: 3.87 ± 0.602
3.958GluArg: 3.958 ± 0.921
4.046GluSer: 4.046 ± 0.506
4.573GluThr: 4.573 ± 0.648
5.101GluVal: 5.101 ± 0.649
0.792GluTrp: 0.792 ± 0.251
3.518GluTyr: 3.518 ± 0.444
0.0GluXaa: 0.0 ± 0.0
Phe
2.814PheAla: 2.814 ± 0.472
0.088PheCys: 0.088 ± 0.086
3.694PheAsp: 3.694 ± 0.513
2.551PheGlu: 2.551 ± 0.419
1.495PhePhe: 1.495 ± 0.413
2.375PheGly: 2.375 ± 0.345
0.176PheHis: 0.176 ± 0.134
2.287PheIle: 2.287 ± 0.521
3.342PheLys: 3.342 ± 0.522
2.375PheLeu: 2.375 ± 0.573
0.704PheMet: 0.704 ± 0.235
2.375PheAsn: 2.375 ± 0.521
1.319PhePro: 1.319 ± 0.321
1.407PheGln: 1.407 ± 0.357
1.495PheArg: 1.495 ± 0.307
2.814PheSer: 2.814 ± 0.633
2.551PheThr: 2.551 ± 0.461
2.726PheVal: 2.726 ± 0.485
0.616PheTrp: 0.616 ± 0.223
1.671PheTyr: 1.671 ± 0.316
0.0PheXaa: 0.0 ± 0.0
Gly
3.694GlyAla: 3.694 ± 0.685
0.176GlyCys: 0.176 ± 0.217
4.134GlyAsp: 4.134 ± 0.657
3.078GlyGlu: 3.078 ± 0.524
1.759GlyPhe: 1.759 ± 0.357
4.134GlyGly: 4.134 ± 0.979
1.055GlyHis: 1.055 ± 0.282
5.189GlyIle: 5.189 ± 0.698
4.31GlyLys: 4.31 ± 0.678
4.31GlyLeu: 4.31 ± 0.589
2.375GlyMet: 2.375 ± 0.536
3.342GlyAsn: 3.342 ± 0.411
1.319GlyPro: 1.319 ± 0.366
2.375GlyGln: 2.375 ± 0.523
2.111GlyArg: 2.111 ± 0.492
3.606GlySer: 3.606 ± 0.871
4.573GlyThr: 4.573 ± 0.951
4.134GlyVal: 4.134 ± 0.745
1.407GlyTrp: 1.407 ± 0.294
3.958GlyTyr: 3.958 ± 0.6
0.0GlyXaa: 0.0 ± 0.0
His
1.319HisAla: 1.319 ± 0.425
0.176HisCys: 0.176 ± 0.103
0.616HisAsp: 0.616 ± 0.232
0.792HisGlu: 0.792 ± 0.273
0.528HisPhe: 0.528 ± 0.231
0.792HisGly: 0.792 ± 0.281
0.0HisHis: 0.0 ± 0.0
1.055HisIle: 1.055 ± 0.303
0.704HisLys: 0.704 ± 0.255
0.967HisLeu: 0.967 ± 0.258
0.352HisMet: 0.352 ± 0.167
1.055HisAsn: 1.055 ± 0.343
0.44HisPro: 0.44 ± 0.177
0.176HisGln: 0.176 ± 0.118
0.792HisArg: 0.792 ± 0.268
0.616HisSer: 0.616 ± 0.359
0.88HisThr: 0.88 ± 0.268
0.88HisVal: 0.88 ± 0.306
0.176HisTrp: 0.176 ± 0.115
0.44HisTyr: 0.44 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
5.013IleAla: 5.013 ± 0.722
0.44IleCys: 0.44 ± 0.2
5.981IleAsp: 5.981 ± 0.627
5.541IleGlu: 5.541 ± 0.934
1.847IlePhe: 1.847 ± 0.391
5.101IleGly: 5.101 ± 0.704
1.055IleHis: 1.055 ± 0.235
4.573IleIle: 4.573 ± 0.665
4.925IleLys: 4.925 ± 0.592
4.573IleLeu: 4.573 ± 0.672
1.231IleMet: 1.231 ± 0.442
4.749IleAsn: 4.749 ± 0.693
2.639IlePro: 2.639 ± 0.637
2.463IleGln: 2.463 ± 0.443
2.463IleArg: 2.463 ± 0.452
5.277IleSer: 5.277 ± 0.498
5.893IleThr: 5.893 ± 0.732
4.925IleVal: 4.925 ± 0.671
0.704IleTrp: 0.704 ± 0.255
2.639IleTyr: 2.639 ± 0.553
0.0IleXaa: 0.0 ± 0.0
Lys
5.805LysAla: 5.805 ± 0.788
0.088LysCys: 0.088 ± 0.095
5.541LysAsp: 5.541 ± 0.812
5.013LysGlu: 5.013 ± 0.771
2.287LysPhe: 2.287 ± 0.367
4.398LysGly: 4.398 ± 0.7
1.319LysHis: 1.319 ± 0.354
6.772LysIle: 6.772 ± 0.861
6.069LysLys: 6.069 ± 0.926
6.157LysLeu: 6.157 ± 0.665
1.935LysMet: 1.935 ± 0.498
5.013LysAsn: 5.013 ± 0.604
2.551LysPro: 2.551 ± 0.588
4.485LysGln: 4.485 ± 0.639
4.485LysArg: 4.485 ± 0.732
6.332LysSer: 6.332 ± 0.605
5.101LysThr: 5.101 ± 0.813
5.365LysVal: 5.365 ± 0.722
1.231LysTrp: 1.231 ± 0.377
4.134LysTyr: 4.134 ± 0.693
0.0LysXaa: 0.0 ± 0.0
Leu
5.717LeuAla: 5.717 ± 0.856
0.44LeuCys: 0.44 ± 0.206
5.805LeuAsp: 5.805 ± 0.73
7.476LeuGlu: 7.476 ± 0.799
2.463LeuPhe: 2.463 ± 0.406
3.87LeuGly: 3.87 ± 0.763
0.792LeuHis: 0.792 ± 0.26
5.981LeuIle: 5.981 ± 0.822
7.564LeuLys: 7.564 ± 0.849
6.069LeuLeu: 6.069 ± 0.849
1.495LeuMet: 1.495 ± 0.335
4.485LeuAsn: 4.485 ± 0.662
2.023LeuPro: 2.023 ± 0.606
3.342LeuGln: 3.342 ± 0.594
2.902LeuArg: 2.902 ± 0.514
6.332LeuSer: 6.332 ± 0.813
6.157LeuThr: 6.157 ± 0.58
4.573LeuVal: 4.573 ± 0.692
0.704LeuTrp: 0.704 ± 0.202
3.078LeuTyr: 3.078 ± 0.448
0.0LeuXaa: 0.0 ± 0.0
Met
1.495MetAla: 1.495 ± 0.344
0.0MetCys: 0.0 ± 0.0
1.407MetAsp: 1.407 ± 0.382
1.583MetGlu: 1.583 ± 0.439
1.143MetPhe: 1.143 ± 0.249
0.528MetGly: 0.528 ± 0.175
0.264MetHis: 0.264 ± 0.151
1.495MetIle: 1.495 ± 0.366
1.055MetLys: 1.055 ± 0.28
2.551MetLeu: 2.551 ± 0.523
0.44MetMet: 0.44 ± 0.195
1.143MetAsn: 1.143 ± 0.29
1.143MetPro: 1.143 ± 0.313
0.88MetGln: 0.88 ± 0.339
1.055MetArg: 1.055 ± 0.276
1.495MetSer: 1.495 ± 0.27
2.375MetThr: 2.375 ± 0.377
1.671MetVal: 1.671 ± 0.427
0.176MetTrp: 0.176 ± 0.127
0.704MetTyr: 0.704 ± 0.274
0.0MetXaa: 0.0 ± 0.0
Asn
4.222AsnAla: 4.222 ± 0.529
0.176AsnCys: 0.176 ± 0.116
3.518AsnAsp: 3.518 ± 0.434
3.606AsnGlu: 3.606 ± 0.523
2.199AsnPhe: 2.199 ± 0.408
5.101AsnGly: 5.101 ± 0.571
0.967AsnHis: 0.967 ± 0.254
4.134AsnIle: 4.134 ± 0.629
5.629AsnLys: 5.629 ± 0.505
5.013AsnLeu: 5.013 ± 0.665
1.231AsnMet: 1.231 ± 0.369
4.134AsnAsn: 4.134 ± 0.567
2.199AsnPro: 2.199 ± 0.581
2.375AsnGln: 2.375 ± 0.389
1.319AsnArg: 1.319 ± 0.269
3.166AsnSer: 3.166 ± 0.638
2.99AsnThr: 2.99 ± 0.514
3.87AsnVal: 3.87 ± 0.519
0.88AsnTrp: 0.88 ± 0.267
2.199AsnTyr: 2.199 ± 0.31
0.0AsnXaa: 0.0 ± 0.0
Pro
1.319ProAla: 1.319 ± 0.345
0.0ProCys: 0.0 ± 0.0
1.935ProAsp: 1.935 ± 0.626
2.111ProGlu: 2.111 ± 0.513
1.847ProPhe: 1.847 ± 0.478
0.88ProGly: 0.88 ± 0.224
0.616ProHis: 0.616 ± 0.201
1.495ProIle: 1.495 ± 0.383
2.99ProLys: 2.99 ± 0.742
2.551ProLeu: 2.551 ± 0.495
0.528ProMet: 0.528 ± 0.209
1.759ProAsn: 1.759 ± 0.387
1.055ProPro: 1.055 ± 0.236
1.055ProGln: 1.055 ± 0.306
1.143ProArg: 1.143 ± 0.369
3.166ProSer: 3.166 ± 0.525
1.583ProThr: 1.583 ± 0.541
1.583ProVal: 1.583 ± 0.31
0.088ProTrp: 0.088 ± 0.09
1.495ProTyr: 1.495 ± 0.354
0.0ProXaa: 0.0 ± 0.0
Gln
3.078GlnAla: 3.078 ± 0.477
0.264GlnCys: 0.264 ± 0.129
2.199GlnAsp: 2.199 ± 0.416
2.199GlnGlu: 2.199 ± 0.45
1.319GlnPhe: 1.319 ± 0.34
2.111GlnGly: 2.111 ± 0.576
0.704GlnHis: 0.704 ± 0.222
2.375GlnIle: 2.375 ± 0.372
4.134GlnLys: 4.134 ± 0.666
3.254GlnLeu: 3.254 ± 0.601
0.88GlnMet: 0.88 ± 0.269
2.902GlnAsn: 2.902 ± 0.523
1.231GlnPro: 1.231 ± 0.339
1.495GlnGln: 1.495 ± 0.407
1.407GlnArg: 1.407 ± 0.337
3.254GlnSer: 3.254 ± 0.42
1.671GlnThr: 1.671 ± 0.403
3.254GlnVal: 3.254 ± 0.446
0.44GlnTrp: 0.44 ± 0.178
1.495GlnTyr: 1.495 ± 0.382
0.0GlnXaa: 0.0 ± 0.0
Arg
2.551ArgAla: 2.551 ± 0.393
0.088ArgCys: 0.088 ± 0.075
2.199ArgAsp: 2.199 ± 0.347
3.254ArgGlu: 3.254 ± 0.564
1.319ArgPhe: 1.319 ± 0.378
2.375ArgGly: 2.375 ± 0.489
0.44ArgHis: 0.44 ± 0.232
3.166ArgIle: 3.166 ± 0.574
4.134ArgLys: 4.134 ± 0.649
3.694ArgLeu: 3.694 ± 0.676
0.792ArgMet: 0.792 ± 0.245
1.847ArgAsn: 1.847 ± 0.391
1.495ArgPro: 1.495 ± 0.34
2.111ArgGln: 2.111 ± 0.469
2.287ArgArg: 2.287 ± 0.487
1.495ArgSer: 1.495 ± 0.422
2.463ArgThr: 2.463 ± 0.478
1.935ArgVal: 1.935 ± 0.444
0.704ArgTrp: 0.704 ± 0.367
1.759ArgTyr: 1.759 ± 0.505
0.0ArgXaa: 0.0 ± 0.0
Ser
5.101SerAla: 5.101 ± 0.832
0.088SerCys: 0.088 ± 0.094
5.013SerAsp: 5.013 ± 0.704
4.573SerGlu: 4.573 ± 0.658
3.43SerPhe: 3.43 ± 0.721
5.805SerGly: 5.805 ± 1.095
0.88SerHis: 0.88 ± 0.317
4.398SerIle: 4.398 ± 0.586
5.365SerLys: 5.365 ± 0.735
5.277SerLeu: 5.277 ± 0.572
1.671SerMet: 1.671 ± 0.501
2.639SerAsn: 2.639 ± 0.55
1.495SerPro: 1.495 ± 0.335
3.87SerGln: 3.87 ± 0.648
1.847SerArg: 1.847 ± 0.369
5.013SerSer: 5.013 ± 1.326
3.694SerThr: 3.694 ± 0.756
4.222SerVal: 4.222 ± 0.717
0.528SerTrp: 0.528 ± 0.233
2.111SerTyr: 2.111 ± 0.427
0.0SerXaa: 0.0 ± 0.0
Thr
3.518ThrAla: 3.518 ± 0.559
0.0ThrCys: 0.0 ± 0.0
4.134ThrAsp: 4.134 ± 0.607
3.87ThrGlu: 3.87 ± 0.6
3.43ThrPhe: 3.43 ± 0.604
4.485ThrGly: 4.485 ± 0.766
1.055ThrHis: 1.055 ± 0.287
5.981ThrIle: 5.981 ± 1.132
5.453ThrLys: 5.453 ± 0.67
6.069ThrLeu: 6.069 ± 0.851
0.792ThrMet: 0.792 ± 0.252
3.782ThrAsn: 3.782 ± 0.675
2.023ThrPro: 2.023 ± 0.393
2.023ThrGln: 2.023 ± 0.404
2.463ThrArg: 2.463 ± 0.364
3.518ThrSer: 3.518 ± 0.494
4.837ThrThr: 4.837 ± 0.881
4.398ThrVal: 4.398 ± 0.923
0.704ThrTrp: 0.704 ± 0.196
2.463ThrTyr: 2.463 ± 0.517
0.0ThrXaa: 0.0 ± 0.0
Val
4.485ValAla: 4.485 ± 0.628
0.616ValCys: 0.616 ± 0.267
4.749ValAsp: 4.749 ± 0.593
4.837ValGlu: 4.837 ± 0.839
2.551ValPhe: 2.551 ± 0.435
3.782ValGly: 3.782 ± 0.779
0.704ValHis: 0.704 ± 0.266
3.166ValIle: 3.166 ± 0.566
5.541ValLys: 5.541 ± 0.812
3.958ValLeu: 3.958 ± 0.549
1.759ValMet: 1.759 ± 0.421
3.254ValAsn: 3.254 ± 0.482
2.199ValPro: 2.199 ± 0.376
2.111ValGln: 2.111 ± 0.529
2.551ValArg: 2.551 ± 0.515
5.453ValSer: 5.453 ± 0.602
5.365ValThr: 5.365 ± 0.801
4.573ValVal: 4.573 ± 0.761
0.528ValTrp: 0.528 ± 0.219
2.375ValTyr: 2.375 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
0.616TrpAla: 0.616 ± 0.238
0.264TrpCys: 0.264 ± 0.134
0.704TrpAsp: 0.704 ± 0.279
0.528TrpGlu: 0.528 ± 0.191
0.44TrpPhe: 0.44 ± 0.203
0.88TrpGly: 0.88 ± 0.357
0.176TrpHis: 0.176 ± 0.128
0.704TrpIle: 0.704 ± 0.193
0.88TrpLys: 0.88 ± 0.29
1.143TrpLeu: 1.143 ± 0.37
0.352TrpMet: 0.352 ± 0.201
0.616TrpAsn: 0.616 ± 0.206
0.088TrpPro: 0.088 ± 0.105
0.528TrpGln: 0.528 ± 0.183
0.967TrpArg: 0.967 ± 0.295
1.231TrpSer: 1.231 ± 0.385
0.88TrpThr: 0.88 ± 0.264
0.88TrpVal: 0.88 ± 0.253
0.176TrpTrp: 0.176 ± 0.176
0.352TrpTyr: 0.352 ± 0.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.166TyrAla: 3.166 ± 0.665
0.352TyrCys: 0.352 ± 0.19
2.902TyrAsp: 2.902 ± 0.423
2.375TyrGlu: 2.375 ± 0.433
2.463TyrPhe: 2.463 ± 0.552
2.99TyrGly: 2.99 ± 0.44
0.704TyrHis: 0.704 ± 0.231
3.43TyrIle: 3.43 ± 0.573
3.342TyrLys: 3.342 ± 0.612
4.573TyrLeu: 4.573 ± 0.654
0.88TyrMet: 0.88 ± 0.221
2.463TyrAsn: 2.463 ± 0.455
1.759TyrPro: 1.759 ± 0.392
1.671TyrGln: 1.671 ± 0.363
1.935TyrArg: 1.935 ± 0.368
2.023TyrSer: 2.023 ± 0.386
1.671TyrThr: 1.671 ± 0.398
2.375TyrVal: 2.375 ± 0.488
0.616TyrTrp: 0.616 ± 0.213
2.375TyrTyr: 2.375 ± 0.415
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (11371 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski