Amino acid dipepetide frequency for Clostridium phage phiCTC2A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.87AlaAla: 1.87 ± 0.368
0.36AlaCys: 0.36 ± 0.177
2.518AlaAsp: 2.518 ± 0.382
4.244AlaGlu: 4.244 ± 0.629
1.655AlaPhe: 1.655 ± 0.305
2.59AlaGly: 2.59 ± 0.619
0.575AlaHis: 0.575 ± 0.217
4.1AlaIle: 4.1 ± 0.633
4.748AlaLys: 4.748 ± 0.66
4.46AlaLeu: 4.46 ± 0.663
1.367AlaMet: 1.367 ± 0.313
3.165AlaAsn: 3.165 ± 0.484
0.791AlaPro: 0.791 ± 0.21
1.367AlaGln: 1.367 ± 0.245
0.935AlaArg: 0.935 ± 0.298
2.518AlaSer: 2.518 ± 0.459
3.021AlaThr: 3.021 ± 0.467
1.798AlaVal: 1.798 ± 0.35
0.504AlaTrp: 0.504 ± 0.156
1.655AlaTyr: 1.655 ± 0.349
0.0AlaXaa: 0.0 ± 0.0
Cys
0.144CysAla: 0.144 ± 0.093
0.216CysCys: 0.216 ± 0.111
0.575CysAsp: 0.575 ± 0.19
1.295CysGlu: 1.295 ± 0.4
0.432CysPhe: 0.432 ± 0.174
0.432CysGly: 0.432 ± 0.148
0.144CysHis: 0.144 ± 0.088
1.367CysIle: 1.367 ± 0.354
1.511CysLys: 1.511 ± 0.287
0.432CysLeu: 0.432 ± 0.145
0.504CysMet: 0.504 ± 0.16
0.719CysAsn: 0.719 ± 0.229
0.144CysPro: 0.144 ± 0.116
0.072CysGln: 0.072 ± 0.074
0.288CysArg: 0.288 ± 0.132
1.007CysSer: 1.007 ± 0.296
0.575CysThr: 0.575 ± 0.194
0.432CysVal: 0.432 ± 0.149
0.144CysTrp: 0.144 ± 0.109
0.647CysTyr: 0.647 ± 0.226
0.0CysXaa: 0.0 ± 0.0
Asp
1.439AspAla: 1.439 ± 0.377
0.575AspCys: 0.575 ± 0.194
2.518AspAsp: 2.518 ± 0.407
4.316AspGlu: 4.316 ± 0.561
3.309AspPhe: 3.309 ± 0.522
5.179AspGly: 5.179 ± 0.702
0.504AspHis: 0.504 ± 0.193
7.481AspIle: 7.481 ± 0.853
7.481AspLys: 7.481 ± 1.029
6.043AspLeu: 6.043 ± 0.606
1.439AspMet: 1.439 ± 0.332
4.028AspAsn: 4.028 ± 0.521
0.935AspPro: 0.935 ± 0.2
1.439AspGln: 1.439 ± 0.354
2.014AspArg: 2.014 ± 0.388
2.59AspSer: 2.59 ± 0.435
3.453AspThr: 3.453 ± 0.472
3.669AspVal: 3.669 ± 0.53
1.007AspTrp: 1.007 ± 0.276
3.813AspTyr: 3.813 ± 0.643
0.0AspXaa: 0.0 ± 0.0
Glu
3.093GluAla: 3.093 ± 0.426
1.223GluCys: 1.223 ± 0.332
4.82GluAsp: 4.82 ± 0.57
7.481GluGlu: 7.481 ± 0.739
3.165GluPhe: 3.165 ± 0.396
3.813GluGly: 3.813 ± 0.433
1.798GluHis: 1.798 ± 0.341
7.769GluIle: 7.769 ± 0.911
10.431GluLys: 10.431 ± 1.037
9.136GluLeu: 9.136 ± 0.871
2.23GluMet: 2.23 ± 0.419
5.467GluAsn: 5.467 ± 0.624
1.655GluPro: 1.655 ± 0.331
2.302GluGln: 2.302 ± 0.403
3.237GluArg: 3.237 ± 0.496
3.453GluSer: 3.453 ± 0.417
3.885GluThr: 3.885 ± 0.493
4.892GluVal: 4.892 ± 0.675
1.151GluTrp: 1.151 ± 0.246
4.1GluTyr: 4.1 ± 0.602
0.0GluXaa: 0.0 ± 0.0
Phe
1.007PheAla: 1.007 ± 0.236
0.36PheCys: 0.36 ± 0.169
2.877PheAsp: 2.877 ± 0.489
3.381PheGlu: 3.381 ± 0.516
1.295PhePhe: 1.295 ± 0.319
1.798PheGly: 1.798 ± 0.346
0.432PheHis: 0.432 ± 0.154
3.597PheIle: 3.597 ± 0.479
5.827PheLys: 5.827 ± 0.558
3.021PheLeu: 3.021 ± 0.4
0.791PheMet: 0.791 ± 0.201
3.957PheAsn: 3.957 ± 0.513
0.504PhePro: 0.504 ± 0.167
0.719PheGln: 0.719 ± 0.202
1.439PheArg: 1.439 ± 0.254
1.583PheSer: 1.583 ± 0.288
1.798PheThr: 1.798 ± 0.414
1.223PheVal: 1.223 ± 0.302
0.288PheTrp: 0.288 ± 0.122
2.014PheTyr: 2.014 ± 0.382
0.0PheXaa: 0.0 ± 0.0
Gly
2.59GlyAla: 2.59 ± 0.553
0.432GlyCys: 0.432 ± 0.157
2.302GlyAsp: 2.302 ± 0.341
4.172GlyGlu: 4.172 ± 0.494
1.87GlyPhe: 1.87 ± 0.639
2.949GlyGly: 2.949 ± 0.523
0.504GlyHis: 0.504 ± 0.149
5.467GlyIle: 5.467 ± 0.712
6.906GlyLys: 6.906 ± 0.908
4.1GlyLeu: 4.1 ± 0.562
1.511GlyMet: 1.511 ± 0.296
4.532GlyAsn: 4.532 ± 0.562
0.575GlyPro: 0.575 ± 0.226
1.942GlyGln: 1.942 ± 0.373
1.511GlyArg: 1.511 ± 0.381
2.518GlySer: 2.518 ± 0.464
3.237GlyThr: 3.237 ± 0.628
3.093GlyVal: 3.093 ± 0.441
1.079GlyTrp: 1.079 ± 0.309
2.949GlyTyr: 2.949 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
0.647HisAla: 0.647 ± 0.205
0.144HisCys: 0.144 ± 0.085
0.935HisAsp: 0.935 ± 0.3
1.079HisGlu: 1.079 ± 0.298
0.719HisPhe: 0.719 ± 0.208
0.504HisGly: 0.504 ± 0.175
0.072HisHis: 0.072 ± 0.071
0.719HisIle: 0.719 ± 0.202
1.798HisLys: 1.798 ± 0.373
1.079HisLeu: 1.079 ± 0.285
0.36HisMet: 0.36 ± 0.149
0.647HisAsn: 0.647 ± 0.223
0.36HisPro: 0.36 ± 0.169
0.216HisGln: 0.216 ± 0.115
1.079HisArg: 1.079 ± 0.258
0.575HisSer: 0.575 ± 0.263
0.719HisThr: 0.719 ± 0.2
0.575HisVal: 0.575 ± 0.18
0.647HisTrp: 0.647 ± 0.168
0.432HisTyr: 0.432 ± 0.154
0.0HisXaa: 0.0 ± 0.0
Ile
3.453IleAla: 3.453 ± 0.447
1.079IleCys: 1.079 ± 0.31
6.546IleAsp: 6.546 ± 0.887
9.064IleGlu: 9.064 ± 0.976
2.23IlePhe: 2.23 ± 0.373
4.964IleGly: 4.964 ± 0.605
1.151IleHis: 1.151 ± 0.226
6.546IleIle: 6.546 ± 0.908
11.726IleLys: 11.726 ± 1.161
7.338IleLeu: 7.338 ± 0.658
1.295IleMet: 1.295 ± 0.428
8.489IleAsn: 8.489 ± 0.907
2.59IlePro: 2.59 ± 0.455
2.59IleGln: 2.59 ± 0.388
3.669IleArg: 3.669 ± 0.455
4.604IleSer: 4.604 ± 0.608
4.46IleThr: 4.46 ± 0.539
4.1IleVal: 4.1 ± 0.555
0.575IleTrp: 0.575 ± 0.174
4.244IleTyr: 4.244 ± 0.574
0.0IleXaa: 0.0 ± 0.0
Lys
4.964LysAla: 4.964 ± 0.498
1.439LysCys: 1.439 ± 0.411
8.776LysAsp: 8.776 ± 1.111
12.229LysGlu: 12.229 ± 1.015
3.885LysPhe: 3.885 ± 0.503
5.755LysGly: 5.755 ± 0.663
1.942LysHis: 1.942 ± 0.281
8.992LysIle: 8.992 ± 0.764
11.366LysLys: 11.366 ± 1.11
9.64LysLeu: 9.64 ± 0.845
3.885LysMet: 3.885 ± 0.525
7.05LysAsn: 7.05 ± 0.622
2.877LysPro: 2.877 ± 0.514
4.604LysGln: 4.604 ± 0.547
3.813LysArg: 3.813 ± 0.514
6.69LysSer: 6.69 ± 0.82
5.467LysThr: 5.467 ± 0.531
5.971LysVal: 5.971 ± 0.65
1.295LysTrp: 1.295 ± 0.304
6.115LysTyr: 6.115 ± 0.963
0.0LysXaa: 0.0 ± 0.0
Leu
4.172LeuAla: 4.172 ± 0.641
1.079LeuCys: 1.079 ± 0.267
5.683LeuAsp: 5.683 ± 0.689
8.201LeuGlu: 8.201 ± 0.704
3.381LeuPhe: 3.381 ± 0.457
5.036LeuGly: 5.036 ± 0.988
1.079LeuHis: 1.079 ± 0.396
6.043LeuIle: 6.043 ± 0.492
11.726LeuLys: 11.726 ± 0.838
6.043LeuLeu: 6.043 ± 0.787
1.87LeuMet: 1.87 ± 0.314
6.259LeuAsn: 6.259 ± 0.73
1.726LeuPro: 1.726 ± 0.353
2.806LeuGln: 2.806 ± 0.423
2.518LeuArg: 2.518 ± 0.489
4.46LeuSer: 4.46 ± 0.669
3.741LeuThr: 3.741 ± 0.63
4.316LeuVal: 4.316 ± 0.615
0.647LeuTrp: 0.647 ± 0.211
3.165LeuTyr: 3.165 ± 0.45
0.0LeuXaa: 0.0 ± 0.0
Met
1.942MetAla: 1.942 ± 0.55
0.144MetCys: 0.144 ± 0.102
1.583MetAsp: 1.583 ± 0.28
2.518MetGlu: 2.518 ± 0.438
0.791MetPhe: 0.791 ± 0.223
0.935MetGly: 0.935 ± 0.318
0.216MetHis: 0.216 ± 0.117
1.439MetIle: 1.439 ± 0.271
2.662MetLys: 2.662 ± 0.435
2.014MetLeu: 2.014 ± 0.399
0.719MetMet: 0.719 ± 0.29
1.87MetAsn: 1.87 ± 0.348
0.935MetPro: 0.935 ± 0.291
1.367MetGln: 1.367 ± 0.312
0.791MetArg: 0.791 ± 0.214
2.158MetSer: 2.158 ± 0.376
0.863MetThr: 0.863 ± 0.236
1.079MetVal: 1.079 ± 0.261
0.432MetTrp: 0.432 ± 0.202
1.367MetTyr: 1.367 ± 0.305
0.0MetXaa: 0.0 ± 0.0
Asn
2.949AsnAla: 2.949 ± 0.382
0.647AsnCys: 0.647 ± 0.196
3.957AsnAsp: 3.957 ± 0.528
4.46AsnGlu: 4.46 ± 0.653
3.813AsnPhe: 3.813 ± 0.515
3.309AsnGly: 3.309 ± 0.532
0.647AsnHis: 0.647 ± 0.268
8.201AsnIle: 8.201 ± 0.88
8.561AsnLys: 8.561 ± 1.046
6.402AsnLeu: 6.402 ± 0.68
1.726AsnMet: 1.726 ± 0.348
5.899AsnAsn: 5.899 ± 0.802
2.806AsnPro: 2.806 ± 0.567
1.655AsnGln: 1.655 ± 0.37
2.949AsnArg: 2.949 ± 0.411
4.316AsnSer: 4.316 ± 0.618
3.381AsnThr: 3.381 ± 0.47
3.021AsnVal: 3.021 ± 0.455
0.863AsnTrp: 0.863 ± 0.225
3.669AsnTyr: 3.669 ± 0.476
0.0AsnXaa: 0.0 ± 0.0
Pro
1.223ProAla: 1.223 ± 0.289
0.216ProCys: 0.216 ± 0.12
1.439ProAsp: 1.439 ± 0.361
2.086ProGlu: 2.086 ± 0.355
1.367ProPhe: 1.367 ± 0.363
0.863ProGly: 0.863 ± 0.265
0.288ProHis: 0.288 ± 0.123
2.302ProIle: 2.302 ± 0.462
1.798ProLys: 1.798 ± 0.336
1.798ProLeu: 1.798 ± 0.447
0.647ProMet: 0.647 ± 0.256
1.367ProAsn: 1.367 ± 0.294
0.863ProPro: 0.863 ± 0.325
0.719ProGln: 0.719 ± 0.204
0.719ProArg: 0.719 ± 0.267
2.014ProSer: 2.014 ± 0.332
0.863ProThr: 0.863 ± 0.239
2.086ProVal: 2.086 ± 0.408
0.144ProTrp: 0.144 ± 0.105
1.295ProTyr: 1.295 ± 0.342
0.0ProXaa: 0.0 ± 0.0
Gln
1.726GlnAla: 1.726 ± 0.328
0.144GlnCys: 0.144 ± 0.089
1.439GlnAsp: 1.439 ± 0.409
2.518GlnGlu: 2.518 ± 0.386
1.223GlnPhe: 1.223 ± 0.27
1.87GlnGly: 1.87 ± 0.322
0.575GlnHis: 0.575 ± 0.209
3.021GlnIle: 3.021 ± 0.429
2.662GlnLys: 2.662 ± 0.398
2.806GlnLeu: 2.806 ± 0.404
0.647GlnMet: 0.647 ± 0.213
2.302GlnAsn: 2.302 ± 0.353
0.935GlnPro: 0.935 ± 0.276
1.151GlnGln: 1.151 ± 0.291
1.223GlnArg: 1.223 ± 0.272
1.726GlnSer: 1.726 ± 0.338
1.151GlnThr: 1.151 ± 0.273
1.151GlnVal: 1.151 ± 0.257
0.36GlnTrp: 0.36 ± 0.181
1.439GlnTyr: 1.439 ± 0.244
0.0GlnXaa: 0.0 ± 0.0
Arg
1.655ArgAla: 1.655 ± 0.339
0.432ArgCys: 0.432 ± 0.162
2.662ArgAsp: 2.662 ± 0.424
2.949ArgGlu: 2.949 ± 0.463
1.151ArgPhe: 1.151 ± 0.316
1.583ArgGly: 1.583 ± 0.313
0.504ArgHis: 0.504 ± 0.165
3.309ArgIle: 3.309 ± 0.56
3.093ArgLys: 3.093 ± 0.51
2.518ArgLeu: 2.518 ± 0.537
1.223ArgMet: 1.223 ± 0.342
1.726ArgAsn: 1.726 ± 0.386
0.863ArgPro: 0.863 ± 0.257
1.367ArgGln: 1.367 ± 0.33
1.655ArgArg: 1.655 ± 0.457
1.511ArgSer: 1.511 ± 0.405
2.014ArgThr: 2.014 ± 0.345
2.23ArgVal: 2.23 ± 0.396
0.647ArgTrp: 0.647 ± 0.245
2.014ArgTyr: 2.014 ± 0.343
0.0ArgXaa: 0.0 ± 0.0
Ser
2.446SerAla: 2.446 ± 0.718
0.791SerCys: 0.791 ± 0.256
3.309SerAsp: 3.309 ± 0.506
4.028SerGlu: 4.028 ± 0.734
2.014SerPhe: 2.014 ± 0.417
3.093SerGly: 3.093 ± 0.57
0.791SerHis: 0.791 ± 0.243
5.036SerIle: 5.036 ± 0.669
6.187SerLys: 6.187 ± 0.58
4.244SerLeu: 4.244 ± 0.536
1.295SerMet: 1.295 ± 0.295
4.892SerAsn: 4.892 ± 0.77
1.007SerPro: 1.007 ± 0.221
1.151SerGln: 1.151 ± 0.27
2.446SerArg: 2.446 ± 0.357
2.374SerSer: 2.374 ± 0.44
2.158SerThr: 2.158 ± 0.372
1.942SerVal: 1.942 ± 0.341
0.575SerTrp: 0.575 ± 0.207
2.158SerTyr: 2.158 ± 0.462
0.0SerXaa: 0.0 ± 0.0
Thr
2.446ThrAla: 2.446 ± 0.51
0.288ThrCys: 0.288 ± 0.181
3.741ThrAsp: 3.741 ± 0.542
3.957ThrGlu: 3.957 ± 0.555
2.014ThrPhe: 2.014 ± 0.324
3.957ThrGly: 3.957 ± 0.479
0.575ThrHis: 0.575 ± 0.175
5.108ThrIle: 5.108 ± 0.673
4.892ThrLys: 4.892 ± 0.63
4.388ThrLeu: 4.388 ± 0.525
1.223ThrMet: 1.223 ± 0.327
3.165ThrAsn: 3.165 ± 0.416
1.655ThrPro: 1.655 ± 0.358
1.439ThrGln: 1.439 ± 0.395
0.719ThrArg: 0.719 ± 0.232
2.806ThrSer: 2.806 ± 0.407
2.806ThrThr: 2.806 ± 0.526
2.518ThrVal: 2.518 ± 0.492
0.432ThrTrp: 0.432 ± 0.171
1.511ThrTyr: 1.511 ± 0.308
0.0ThrXaa: 0.0 ± 0.0
Val
4.1ValAla: 4.1 ± 0.591
0.36ValCys: 0.36 ± 0.164
4.028ValAsp: 4.028 ± 0.464
3.309ValGlu: 3.309 ± 0.432
1.367ValPhe: 1.367 ± 0.392
1.726ValGly: 1.726 ± 0.307
0.432ValHis: 0.432 ± 0.214
4.676ValIle: 4.676 ± 0.459
5.467ValLys: 5.467 ± 0.698
4.1ValLeu: 4.1 ± 0.507
1.655ValMet: 1.655 ± 0.373
3.309ValAsn: 3.309 ± 0.414
1.439ValPro: 1.439 ± 0.282
1.655ValGln: 1.655 ± 0.402
1.87ValArg: 1.87 ± 0.405
2.014ValSer: 2.014 ± 0.432
3.021ValThr: 3.021 ± 0.596
2.877ValVal: 2.877 ± 0.498
0.647ValTrp: 0.647 ± 0.218
1.942ValTyr: 1.942 ± 0.385
0.0ValXaa: 0.0 ± 0.0
Trp
0.288TrpAla: 0.288 ± 0.138
0.288TrpCys: 0.288 ± 0.121
0.719TrpAsp: 0.719 ± 0.24
0.935TrpGlu: 0.935 ± 0.273
0.504TrpPhe: 0.504 ± 0.196
1.007TrpGly: 1.007 ± 0.27
0.288TrpHis: 0.288 ± 0.143
1.151TrpIle: 1.151 ± 0.277
1.295TrpLys: 1.295 ± 0.283
1.007TrpLeu: 1.007 ± 0.293
0.288TrpMet: 0.288 ± 0.12
0.863TrpAsn: 0.863 ± 0.215
0.0TrpPro: 0.0 ± 0.0
0.288TrpGln: 0.288 ± 0.159
0.432TrpArg: 0.432 ± 0.185
0.575TrpSer: 0.575 ± 0.199
0.719TrpThr: 0.719 ± 0.203
0.791TrpVal: 0.791 ± 0.174
0.288TrpTrp: 0.288 ± 0.168
0.647TrpTyr: 0.647 ± 0.201
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.086TyrAla: 2.086 ± 0.388
0.863TyrCys: 0.863 ± 0.308
2.806TyrAsp: 2.806 ± 0.478
2.949TyrGlu: 2.949 ± 0.357
1.726TyrPhe: 1.726 ± 0.369
2.877TyrGly: 2.877 ± 0.37
0.791TyrHis: 0.791 ± 0.218
4.388TyrIle: 4.388 ± 0.563
6.618TyrLys: 6.618 ± 0.687
3.381TyrLeu: 3.381 ± 0.381
1.079TyrMet: 1.079 ± 0.257
3.525TyrAsn: 3.525 ± 0.556
1.439TyrPro: 1.439 ± 0.381
1.223TyrGln: 1.223 ± 0.314
1.798TyrArg: 1.798 ± 0.378
2.446TyrSer: 2.446 ± 0.479
2.302TyrThr: 2.302 ± 0.453
2.23TyrVal: 2.23 ± 0.349
0.647TyrTrp: 0.647 ± 0.208
2.014TyrTyr: 2.014 ± 0.401
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (13902 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski