Amino acid dipepetide frequency for Mycobacterium phage Ochi17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.467AlaAla: 15.467 ± 2.204
1.017AlaCys: 1.017 ± 0.249
6.85AlaAsp: 6.85 ± 0.577
7.867AlaGlu: 7.867 ± 0.83
3.318AlaPhe: 3.318 ± 0.506
9.152AlaGly: 9.152 ± 1.34
2.141AlaHis: 2.141 ± 0.343
4.174AlaIle: 4.174 ± 0.525
4.174AlaLys: 4.174 ± 0.506
8.456AlaLeu: 8.456 ± 0.86
2.729AlaMet: 2.729 ± 0.457
2.462AlaAsn: 2.462 ± 0.341
5.459AlaPro: 5.459 ± 0.515
3.96AlaGln: 3.96 ± 0.475
6.85AlaArg: 6.85 ± 0.667
5.459AlaSer: 5.459 ± 0.581
6.155AlaThr: 6.155 ± 0.542
6.797AlaVal: 6.797 ± 0.534
2.622AlaTrp: 2.622 ± 0.502
2.622AlaTyr: 2.622 ± 0.403
0.0AlaXaa: 0.0 ± 0.0
Cys
1.017CysAla: 1.017 ± 0.234
0.054CysCys: 0.054 ± 0.049
1.07CysAsp: 1.07 ± 0.283
0.749CysGlu: 0.749 ± 0.228
0.268CysPhe: 0.268 ± 0.112
1.552CysGly: 1.552 ± 0.333
0.214CysHis: 0.214 ± 0.123
0.161CysIle: 0.161 ± 0.097
0.321CysLys: 0.321 ± 0.124
0.749CysLeu: 0.749 ± 0.255
0.0CysMet: 0.0 ± 0.0
0.321CysAsn: 0.321 ± 0.121
1.07CysPro: 1.07 ± 0.306
0.428CysGln: 0.428 ± 0.154
0.91CysArg: 0.91 ± 0.224
0.749CysSer: 0.749 ± 0.245
0.642CysThr: 0.642 ± 0.208
0.642CysVal: 0.642 ± 0.174
0.214CysTrp: 0.214 ± 0.094
0.214CysTyr: 0.214 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
6.743AspAla: 6.743 ± 0.585
0.856AspCys: 0.856 ± 0.213
4.014AspAsp: 4.014 ± 0.558
3.693AspGlu: 3.693 ± 0.474
1.606AspPhe: 1.606 ± 0.277
6.369AspGly: 6.369 ± 0.533
1.445AspHis: 1.445 ± 0.297
2.301AspIle: 2.301 ± 0.349
1.927AspLys: 1.927 ± 0.291
5.673AspLeu: 5.673 ± 0.499
1.07AspMet: 1.07 ± 0.264
1.766AspAsn: 1.766 ± 0.351
4.174AspPro: 4.174 ± 0.574
2.248AspGln: 2.248 ± 0.347
5.727AspArg: 5.727 ± 0.592
3.265AspSer: 3.265 ± 0.45
4.282AspThr: 4.282 ± 0.53
4.228AspVal: 4.228 ± 0.516
1.606AspTrp: 1.606 ± 0.32
1.873AspTyr: 1.873 ± 0.338
0.0AspXaa: 0.0 ± 0.0
Glu
6.797GluAla: 6.797 ± 0.694
0.91GluCys: 0.91 ± 0.241
3.265GluAsp: 3.265 ± 0.353
3.318GluGlu: 3.318 ± 0.572
2.087GluPhe: 2.087 ± 0.372
3.265GluGly: 3.265 ± 0.407
1.82GluHis: 1.82 ± 0.407
2.569GluIle: 2.569 ± 0.385
1.713GluLys: 1.713 ± 0.283
5.619GluLeu: 5.619 ± 0.731
1.445GluMet: 1.445 ± 0.239
2.034GluAsn: 2.034 ± 0.292
3.104GluPro: 3.104 ± 0.471
2.944GluGln: 2.944 ± 0.319
4.71GluArg: 4.71 ± 0.612
3.746GluSer: 3.746 ± 0.502
3.8GluThr: 3.8 ± 0.608
3.8GluVal: 3.8 ± 0.559
1.445GluTrp: 1.445 ± 0.261
2.034GluTyr: 2.034 ± 0.361
0.0GluXaa: 0.0 ± 0.0
Phe
2.944PheAla: 2.944 ± 0.413
0.161PheCys: 0.161 ± 0.09
2.569PheAsp: 2.569 ± 0.482
1.98PheGlu: 1.98 ± 0.32
0.856PhePhe: 0.856 ± 0.249
2.89PheGly: 2.89 ± 0.636
0.535PheHis: 0.535 ± 0.146
1.445PheIle: 1.445 ± 0.348
0.91PheLys: 0.91 ± 0.244
1.552PheLeu: 1.552 ± 0.246
0.696PheMet: 0.696 ± 0.165
1.284PheAsn: 1.284 ± 0.306
1.927PhePro: 1.927 ± 0.337
1.231PheGln: 1.231 ± 0.335
1.713PheArg: 1.713 ± 0.306
1.659PheSer: 1.659 ± 0.308
2.034PheThr: 2.034 ± 0.357
2.408PheVal: 2.408 ± 0.307
0.535PheTrp: 0.535 ± 0.167
0.963PheTyr: 0.963 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
8.242GlyAla: 8.242 ± 1.085
1.231GlyCys: 1.231 ± 0.281
5.352GlyAsp: 5.352 ± 0.475
4.496GlyGlu: 4.496 ± 0.564
3.265GlyPhe: 3.265 ± 0.55
9.473GlyGly: 9.473 ± 1.49
1.499GlyHis: 1.499 ± 0.273
3.586GlyIle: 3.586 ± 0.562
2.622GlyLys: 2.622 ± 0.346
6.422GlyLeu: 6.422 ± 0.651
2.515GlyMet: 2.515 ± 0.478
2.944GlyAsn: 2.944 ± 0.383
3.8GlyPro: 3.8 ± 0.567
2.194GlyGln: 2.194 ± 0.529
5.191GlyArg: 5.191 ± 0.671
5.994GlySer: 5.994 ± 0.666
6.583GlyThr: 6.583 ± 0.79
6.315GlyVal: 6.315 ± 0.564
2.836GlyTrp: 2.836 ± 0.423
1.659GlyTyr: 1.659 ± 0.298
0.0GlyXaa: 0.0 ± 0.0
His
2.087HisAla: 2.087 ± 0.34
0.268HisCys: 0.268 ± 0.129
0.803HisAsp: 0.803 ± 0.202
0.803HisGlu: 0.803 ± 0.223
0.589HisPhe: 0.589 ± 0.162
1.499HisGly: 1.499 ± 0.28
0.696HisHis: 0.696 ± 0.22
1.231HisIle: 1.231 ± 0.257
0.749HisLys: 0.749 ± 0.184
1.606HisLeu: 1.606 ± 0.351
0.535HisMet: 0.535 ± 0.156
0.642HisAsn: 0.642 ± 0.18
1.284HisPro: 1.284 ± 0.271
1.017HisGln: 1.017 ± 0.267
2.408HisArg: 2.408 ± 0.462
1.124HisSer: 1.124 ± 0.253
1.713HisThr: 1.713 ± 0.348
1.445HisVal: 1.445 ± 0.33
0.749HisTrp: 0.749 ± 0.179
0.963HisTyr: 0.963 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
4.763IleAla: 4.763 ± 0.495
0.589IleCys: 0.589 ± 0.194
3.372IleAsp: 3.372 ± 0.429
3.265IleGlu: 3.265 ± 0.414
0.642IlePhe: 0.642 ± 0.207
3.907IleGly: 3.907 ± 0.445
1.391IleHis: 1.391 ± 0.26
1.499IleIle: 1.499 ± 0.293
1.124IleLys: 1.124 ± 0.243
2.355IleLeu: 2.355 ± 0.41
0.482IleMet: 0.482 ± 0.156
1.82IleAsn: 1.82 ± 0.29
2.676IlePro: 2.676 ± 0.328
1.606IleGln: 1.606 ± 0.277
2.622IleArg: 2.622 ± 0.432
2.729IleSer: 2.729 ± 0.484
3.8IleThr: 3.8 ± 0.417
2.676IleVal: 2.676 ± 0.409
0.856IleTrp: 0.856 ± 0.22
0.803IleTyr: 0.803 ± 0.174
0.0IleXaa: 0.0 ± 0.0
Lys
4.014LysAla: 4.014 ± 0.516
0.321LysCys: 0.321 ± 0.133
1.766LysAsp: 1.766 ± 0.287
1.552LysGlu: 1.552 ± 0.262
1.445LysPhe: 1.445 ± 0.217
2.462LysGly: 2.462 ± 0.326
0.963LysHis: 0.963 ± 0.257
0.963LysIle: 0.963 ± 0.203
1.445LysLys: 1.445 ± 0.358
2.462LysLeu: 2.462 ± 0.418
0.535LysMet: 0.535 ± 0.161
0.91LysAsn: 0.91 ± 0.24
2.087LysPro: 2.087 ± 0.391
1.713LysGln: 1.713 ± 0.252
2.355LysArg: 2.355 ± 0.378
2.408LysSer: 2.408 ± 0.485
2.087LysThr: 2.087 ± 0.413
2.622LysVal: 2.622 ± 0.427
0.91LysTrp: 0.91 ± 0.242
0.803LysTyr: 0.803 ± 0.22
0.0LysXaa: 0.0 ± 0.0
Leu
7.707LeuAla: 7.707 ± 0.662
0.696LeuCys: 0.696 ± 0.214
4.924LeuAsp: 4.924 ± 0.527
4.174LeuGlu: 4.174 ± 0.435
1.873LeuPhe: 1.873 ± 0.28
5.459LeuGly: 5.459 ± 0.549
1.284LeuHis: 1.284 ± 0.264
3.158LeuIle: 3.158 ± 0.401
2.355LeuLys: 2.355 ± 0.392
4.603LeuLeu: 4.603 ± 0.56
1.284LeuMet: 1.284 ± 0.305
2.569LeuAsn: 2.569 ± 0.378
5.673LeuPro: 5.673 ± 0.687
2.676LeuGln: 2.676 ± 0.445
5.191LeuArg: 5.191 ± 0.549
5.298LeuSer: 5.298 ± 0.547
5.138LeuThr: 5.138 ± 0.487
5.512LeuVal: 5.512 ± 0.484
1.231LeuTrp: 1.231 ± 0.235
2.087LeuTyr: 2.087 ± 0.358
0.0LeuXaa: 0.0 ± 0.0
Met
2.301MetAla: 2.301 ± 0.401
0.107MetCys: 0.107 ± 0.081
1.231MetAsp: 1.231 ± 0.26
1.017MetGlu: 1.017 ± 0.191
0.696MetPhe: 0.696 ± 0.224
1.713MetGly: 1.713 ± 0.34
0.161MetHis: 0.161 ± 0.099
0.642MetIle: 0.642 ± 0.206
0.91MetLys: 0.91 ± 0.23
1.98MetLeu: 1.98 ± 0.26
0.482MetMet: 0.482 ± 0.234
0.856MetAsn: 0.856 ± 0.196
1.124MetPro: 1.124 ± 0.234
0.482MetGln: 0.482 ± 0.165
1.338MetArg: 1.338 ± 0.251
2.944MetSer: 2.944 ± 0.4
2.087MetThr: 2.087 ± 0.323
1.284MetVal: 1.284 ± 0.342
0.321MetTrp: 0.321 ± 0.106
0.428MetTyr: 0.428 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
3.211AsnAla: 3.211 ± 0.361
0.321AsnCys: 0.321 ± 0.134
1.713AsnAsp: 1.713 ± 0.34
1.606AsnGlu: 1.606 ± 0.3
0.696AsnPhe: 0.696 ± 0.231
3.853AsnGly: 3.853 ± 0.478
0.856AsnHis: 0.856 ± 0.239
1.606AsnIle: 1.606 ± 0.414
1.124AsnLys: 1.124 ± 0.255
2.194AsnLeu: 2.194 ± 0.325
0.482AsnMet: 0.482 ± 0.138
1.659AsnAsn: 1.659 ± 0.33
2.515AsnPro: 2.515 ± 0.388
1.177AsnGln: 1.177 ± 0.276
1.499AsnArg: 1.499 ± 0.294
1.766AsnSer: 1.766 ± 0.273
2.087AsnThr: 2.087 ± 0.288
2.087AsnVal: 2.087 ± 0.382
0.642AsnTrp: 0.642 ± 0.154
0.91AsnTyr: 0.91 ± 0.195
0.0AsnXaa: 0.0 ± 0.0
Pro
5.459ProAla: 5.459 ± 0.585
0.696ProCys: 0.696 ± 0.192
4.228ProAsp: 4.228 ± 0.465
4.87ProGlu: 4.87 ± 0.544
1.659ProPhe: 1.659 ± 0.356
6.476ProGly: 6.476 ± 0.567
1.338ProHis: 1.338 ± 0.29
1.98ProIle: 1.98 ± 0.332
2.355ProLys: 2.355 ± 0.442
4.014ProLeu: 4.014 ± 0.56
1.766ProMet: 1.766 ± 0.336
2.087ProAsn: 2.087 ± 0.287
3.746ProPro: 3.746 ± 0.645
2.194ProGln: 2.194 ± 0.431
3.479ProArg: 3.479 ± 0.538
3.479ProSer: 3.479 ± 0.362
3.372ProThr: 3.372 ± 0.416
4.282ProVal: 4.282 ± 0.428
1.177ProTrp: 1.177 ± 0.233
1.391ProTyr: 1.391 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
4.442GlnAla: 4.442 ± 0.56
0.375GlnCys: 0.375 ± 0.192
1.552GlnAsp: 1.552 ± 0.252
1.606GlnGlu: 1.606 ± 0.252
1.07GlnPhe: 1.07 ± 0.215
2.836GlnGly: 2.836 ± 0.438
1.017GlnHis: 1.017 ± 0.249
2.462GlnIle: 2.462 ± 0.378
1.338GlnLys: 1.338 ± 0.302
2.783GlnLeu: 2.783 ± 0.45
0.803GlnMet: 0.803 ± 0.205
0.749GlnAsn: 0.749 ± 0.248
2.408GlnPro: 2.408 ± 0.392
1.124GlnGln: 1.124 ± 0.276
2.408GlnArg: 2.408 ± 0.326
2.569GlnSer: 2.569 ± 0.381
1.713GlnThr: 1.713 ± 0.318
2.355GlnVal: 2.355 ± 0.296
0.696GlnTrp: 0.696 ± 0.214
1.07GlnTyr: 1.07 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
6.529ArgAla: 6.529 ± 0.624
0.856ArgCys: 0.856 ± 0.24
5.084ArgAsp: 5.084 ± 0.602
5.191ArgGlu: 5.191 ± 0.64
2.301ArgPhe: 2.301 ± 0.455
3.693ArgGly: 3.693 ± 0.424
1.606ArgHis: 1.606 ± 0.319
3.907ArgIle: 3.907 ± 0.512
2.622ArgLys: 2.622 ± 0.364
5.298ArgLeu: 5.298 ± 0.582
2.676ArgMet: 2.676 ± 0.445
2.301ArgAsn: 2.301 ± 0.367
4.121ArgPro: 4.121 ± 0.461
1.713ArgGln: 1.713 ± 0.384
6.262ArgArg: 6.262 ± 0.946
3.746ArgSer: 3.746 ± 0.396
3.372ArgThr: 3.372 ± 0.515
4.817ArgVal: 4.817 ± 0.548
1.98ArgTrp: 1.98 ± 0.354
1.82ArgTyr: 1.82 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
7.386SerAla: 7.386 ± 1.433
0.482SerCys: 0.482 ± 0.167
4.228SerAsp: 4.228 ± 0.6
2.89SerGlu: 2.89 ± 0.373
2.141SerPhe: 2.141 ± 0.36
6.583SerGly: 6.583 ± 0.754
1.284SerHis: 1.284 ± 0.312
3.051SerIle: 3.051 ± 0.408
2.034SerLys: 2.034 ± 0.349
4.389SerLeu: 4.389 ± 0.59
1.177SerMet: 1.177 ± 0.264
2.194SerAsn: 2.194 ± 0.345
3.746SerPro: 3.746 ± 0.409
1.927SerGln: 1.927 ± 0.277
3.746SerArg: 3.746 ± 0.453
4.067SerSer: 4.067 ± 0.73
3.96SerThr: 3.96 ± 0.436
4.496SerVal: 4.496 ± 0.515
1.07SerTrp: 1.07 ± 0.202
1.552SerTyr: 1.552 ± 0.264
0.0SerXaa: 0.0 ± 0.0
Thr
6.904ThrAla: 6.904 ± 0.701
0.749ThrCys: 0.749 ± 0.275
4.014ThrAsp: 4.014 ± 0.527
4.228ThrGlu: 4.228 ± 0.51
2.087ThrPhe: 2.087 ± 0.342
6.262ThrGly: 6.262 ± 0.632
1.659ThrHis: 1.659 ± 0.31
3.265ThrIle: 3.265 ± 0.451
2.248ThrLys: 2.248 ± 0.378
3.96ThrLeu: 3.96 ± 0.48
1.391ThrMet: 1.391 ± 0.286
2.141ThrAsn: 2.141 ± 0.352
4.335ThrPro: 4.335 ± 0.402
1.82ThrGln: 1.82 ± 0.317
4.174ThrArg: 4.174 ± 0.594
3.586ThrSer: 3.586 ± 0.495
4.603ThrThr: 4.603 ± 0.76
5.834ThrVal: 5.834 ± 0.689
1.284ThrTrp: 1.284 ± 0.316
1.338ThrTyr: 1.338 ± 0.334
0.0ThrXaa: 0.0 ± 0.0
Val
7.279ValAla: 7.279 ± 0.482
1.338ValCys: 1.338 ± 0.268
5.298ValAsp: 5.298 ± 0.528
4.496ValGlu: 4.496 ± 0.53
2.301ValPhe: 2.301 ± 0.394
5.887ValGly: 5.887 ± 0.622
1.284ValHis: 1.284 ± 0.268
2.783ValIle: 2.783 ± 0.436
2.622ValLys: 2.622 ± 0.404
5.191ValLeu: 5.191 ± 0.613
0.963ValMet: 0.963 ± 0.184
1.766ValAsn: 1.766 ± 0.327
4.121ValPro: 4.121 ± 0.446
2.89ValGln: 2.89 ± 0.375
4.603ValArg: 4.603 ± 0.562
5.138ValSer: 5.138 ± 0.62
4.817ValThr: 4.817 ± 0.556
5.941ValVal: 5.941 ± 0.702
1.659ValTrp: 1.659 ± 0.369
1.07ValTyr: 1.07 ± 0.242
0.0ValXaa: 0.0 ± 0.0
Trp
1.873TrpAla: 1.873 ± 0.289
0.054TrpCys: 0.054 ± 0.067
1.659TrpAsp: 1.659 ± 0.293
1.07TrpGlu: 1.07 ± 0.312
0.749TrpPhe: 0.749 ± 0.21
0.91TrpGly: 0.91 ± 0.225
0.589TrpHis: 0.589 ± 0.179
1.07TrpIle: 1.07 ± 0.215
0.749TrpLys: 0.749 ± 0.181
1.82TrpLeu: 1.82 ± 0.406
0.91TrpMet: 0.91 ± 0.262
0.589TrpAsn: 0.589 ± 0.224
1.445TrpPro: 1.445 ± 0.327
1.231TrpGln: 1.231 ± 0.277
2.355TrpArg: 2.355 ± 0.413
1.552TrpSer: 1.552 ± 0.475
1.766TrpThr: 1.766 ± 0.342
1.499TrpVal: 1.499 ± 0.37
0.91TrpTrp: 0.91 ± 0.192
0.589TrpTyr: 0.589 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.569TyrAla: 2.569 ± 0.358
0.268TyrCys: 0.268 ± 0.116
1.873TyrAsp: 1.873 ± 0.404
1.552TyrGlu: 1.552 ± 0.312
0.642TyrPhe: 0.642 ± 0.177
1.98TyrGly: 1.98 ± 0.397
0.428TyrHis: 0.428 ± 0.125
1.124TyrIle: 1.124 ± 0.213
0.535TyrLys: 0.535 ± 0.15
1.766TyrLeu: 1.766 ± 0.348
0.107TyrMet: 0.107 ± 0.066
0.91TyrAsn: 0.91 ± 0.218
1.231TyrPro: 1.231 ± 0.217
0.803TyrGln: 0.803 ± 0.167
2.355TyrArg: 2.355 ± 0.376
1.124TyrSer: 1.124 ± 0.246
1.873TyrThr: 1.873 ± 0.342
2.408TyrVal: 2.408 ± 0.329
0.642TyrTrp: 0.642 ± 0.21
0.482TyrTyr: 0.482 ± 0.148
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 110 proteins (18686 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski