Amino acid dipepetide frequency for Shigella virus 2019SD1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.785AlaAla: 6.785 ± 0.968
0.822AlaCys: 0.822 ± 0.261
3.358AlaAsp: 3.358 ± 0.696
4.524AlaGlu: 4.524 ± 0.521
3.427AlaPhe: 3.427 ± 0.557
4.455AlaGly: 4.455 ± 0.568
1.371AlaHis: 1.371 ± 0.368
6.169AlaIle: 6.169 ± 0.71
6.237AlaLys: 6.237 ± 0.774
6.443AlaLeu: 6.443 ± 0.812
3.016AlaMet: 3.016 ± 0.562
3.633AlaAsn: 3.633 ± 0.636
2.673AlaPro: 2.673 ± 0.404
3.29AlaGln: 3.29 ± 0.538
4.455AlaArg: 4.455 ± 0.595
4.866AlaSer: 4.866 ± 0.534
4.729AlaThr: 4.729 ± 0.928
6.032AlaVal: 6.032 ± 0.783
1.097AlaTrp: 1.097 ± 0.284
2.193AlaTyr: 2.193 ± 0.421
0.0AlaXaa: 0.0 ± 0.0
Cys
1.371CysAla: 1.371 ± 0.359
0.137CysCys: 0.137 ± 0.089
1.028CysAsp: 1.028 ± 0.268
0.685CysGlu: 0.685 ± 0.227
0.411CysPhe: 0.411 ± 0.23
1.165CysGly: 1.165 ± 0.357
0.411CysHis: 0.411 ± 0.16
0.822CysIle: 0.822 ± 0.26
1.165CysLys: 1.165 ± 0.404
0.891CysLeu: 0.891 ± 0.279
0.548CysMet: 0.548 ± 0.174
0.343CysAsn: 0.343 ± 0.182
0.069CysPro: 0.069 ± 0.063
0.274CysGln: 0.274 ± 0.12
1.302CysArg: 1.302 ± 0.276
0.411CysSer: 0.411 ± 0.162
0.754CysThr: 0.754 ± 0.263
0.891CysVal: 0.891 ± 0.262
0.617CysTrp: 0.617 ± 0.204
0.274CysTyr: 0.274 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
4.661AspAla: 4.661 ± 0.526
0.548AspCys: 0.548 ± 0.226
3.77AspAsp: 3.77 ± 0.556
3.77AspGlu: 3.77 ± 0.585
2.193AspPhe: 2.193 ± 0.433
6.306AspGly: 6.306 ± 0.764
1.097AspHis: 1.097 ± 0.305
3.016AspIle: 3.016 ± 0.487
3.838AspLys: 3.838 ± 0.6
4.866AspLeu: 4.866 ± 0.678
1.919AspMet: 1.919 ± 0.365
2.605AspAsn: 2.605 ± 0.535
2.193AspPro: 2.193 ± 0.365
2.33AspGln: 2.33 ± 0.39
2.536AspArg: 2.536 ± 0.466
3.907AspSer: 3.907 ± 0.673
3.221AspThr: 3.221 ± 0.526
2.81AspVal: 2.81 ± 0.469
0.754AspTrp: 0.754 ± 0.304
1.851AspTyr: 1.851 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
4.387GluAla: 4.387 ± 0.575
0.822GluCys: 0.822 ± 0.245
2.673GluAsp: 2.673 ± 0.449
3.838GluGlu: 3.838 ± 0.607
3.358GluPhe: 3.358 ± 0.575
3.29GluGly: 3.29 ± 0.555
0.685GluHis: 0.685 ± 0.297
4.935GluIle: 4.935 ± 0.591
4.249GluLys: 4.249 ± 0.675
3.701GluLeu: 3.701 ± 0.703
2.33GluMet: 2.33 ± 0.38
3.427GluAsn: 3.427 ± 0.54
1.371GluPro: 1.371 ± 0.377
3.016GluGln: 3.016 ± 0.475
2.673GluArg: 2.673 ± 0.585
4.387GluSer: 4.387 ± 0.572
3.907GluThr: 3.907 ± 0.726
4.524GluVal: 4.524 ± 0.499
0.48GluTrp: 0.48 ± 0.162
2.056GluTyr: 2.056 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
2.056PheAla: 2.056 ± 0.38
0.548PheCys: 0.548 ± 0.221
2.879PheAsp: 2.879 ± 0.484
1.508PheGlu: 1.508 ± 0.383
1.714PhePhe: 1.714 ± 0.407
2.81PheGly: 2.81 ± 0.622
0.822PheHis: 0.822 ± 0.29
2.193PheIle: 2.193 ± 0.35
3.016PheLys: 3.016 ± 0.478
2.399PheLeu: 2.399 ± 0.401
0.822PheMet: 0.822 ± 0.21
2.125PheAsn: 2.125 ± 0.366
1.028PhePro: 1.028 ± 0.263
1.439PheGln: 1.439 ± 0.414
2.467PheArg: 2.467 ± 0.46
1.988PheSer: 1.988 ± 0.459
2.879PheThr: 2.879 ± 0.603
2.947PheVal: 2.947 ± 0.457
0.891PheTrp: 0.891 ± 0.263
1.508PheTyr: 1.508 ± 0.467
0.0PheXaa: 0.0 ± 0.0
Gly
5.552GlyAla: 5.552 ± 0.954
0.822GlyCys: 0.822 ± 0.213
5.141GlyAsp: 5.141 ± 1.005
4.044GlyGlu: 4.044 ± 0.595
1.714GlyPhe: 1.714 ± 0.443
6.032GlyGly: 6.032 ± 1.357
1.028GlyHis: 1.028 ± 0.226
4.661GlyIle: 4.661 ± 0.471
5.552GlyLys: 5.552 ± 0.687
5.003GlyLeu: 5.003 ± 0.556
2.262GlyMet: 2.262 ± 0.41
3.29GlyAsn: 3.29 ± 0.662
0.685GlyPro: 0.685 ± 0.291
1.645GlyGln: 1.645 ± 0.378
3.221GlyArg: 3.221 ± 0.434
5.278GlySer: 5.278 ± 0.799
3.907GlyThr: 3.907 ± 0.581
6.306GlyVal: 6.306 ± 0.636
1.302GlyTrp: 1.302 ± 0.418
2.605GlyTyr: 2.605 ± 0.389
0.0GlyXaa: 0.0 ± 0.0
His
1.165HisAla: 1.165 ± 0.392
0.274HisCys: 0.274 ± 0.17
0.96HisAsp: 0.96 ± 0.275
1.165HisGlu: 1.165 ± 0.32
0.48HisPhe: 0.48 ± 0.209
1.714HisGly: 1.714 ± 0.425
0.617HisHis: 0.617 ± 0.245
1.576HisIle: 1.576 ± 0.383
1.234HisLys: 1.234 ± 0.376
1.302HisLeu: 1.302 ± 0.375
0.343HisMet: 0.343 ± 0.148
1.371HisAsn: 1.371 ± 0.403
0.754HisPro: 0.754 ± 0.261
0.754HisGln: 0.754 ± 0.324
1.097HisArg: 1.097 ± 0.298
0.96HisSer: 0.96 ± 0.294
1.028HisThr: 1.028 ± 0.372
1.576HisVal: 1.576 ± 0.359
0.069HisTrp: 0.069 ± 0.062
1.234HisTyr: 1.234 ± 0.277
0.0HisXaa: 0.0 ± 0.0
Ile
5.072IleAla: 5.072 ± 0.616
0.617IleCys: 0.617 ± 0.255
5.483IleAsp: 5.483 ± 0.615
4.044IleGlu: 4.044 ± 0.44
1.988IlePhe: 1.988 ± 0.38
4.387IleGly: 4.387 ± 0.608
1.371IleHis: 1.371 ± 0.33
4.249IleIle: 4.249 ± 0.569
4.935IleLys: 4.935 ± 0.704
3.221IleLeu: 3.221 ± 0.421
2.193IleMet: 2.193 ± 0.499
2.81IleAsn: 2.81 ± 0.603
1.988IlePro: 1.988 ± 0.399
2.399IleGln: 2.399 ± 0.446
4.318IleArg: 4.318 ± 0.613
4.661IleSer: 4.661 ± 0.659
4.318IleThr: 4.318 ± 0.665
3.564IleVal: 3.564 ± 0.55
1.234IleTrp: 1.234 ± 0.333
2.262IleTyr: 2.262 ± 0.469
0.0IleXaa: 0.0 ± 0.0
Lys
7.128LysAla: 7.128 ± 0.909
0.617LysCys: 0.617 ± 0.229
4.181LysAsp: 4.181 ± 0.659
5.278LysGlu: 5.278 ± 0.679
2.673LysPhe: 2.673 ± 0.48
4.181LysGly: 4.181 ± 0.617
1.508LysHis: 1.508 ± 0.389
3.016LysIle: 3.016 ± 0.582
3.016LysLys: 3.016 ± 0.466
5.483LysLeu: 5.483 ± 0.58
3.016LysMet: 3.016 ± 0.739
2.81LysAsn: 2.81 ± 0.621
3.564LysPro: 3.564 ± 0.521
3.633LysGln: 3.633 ± 0.777
4.935LysArg: 4.935 ± 0.877
3.975LysSer: 3.975 ± 0.602
4.661LysThr: 4.661 ± 0.666
4.318LysVal: 4.318 ± 0.679
1.028LysTrp: 1.028 ± 0.257
1.508LysTyr: 1.508 ± 0.444
0.0LysXaa: 0.0 ± 0.0
Leu
6.1LeuAla: 6.1 ± 0.717
1.028LeuCys: 1.028 ± 0.304
3.084LeuAsp: 3.084 ± 0.461
3.564LeuGlu: 3.564 ± 0.544
2.056LeuPhe: 2.056 ± 0.354
3.29LeuGly: 3.29 ± 0.659
1.988LeuHis: 1.988 ± 0.418
5.141LeuIle: 5.141 ± 0.719
5.552LeuLys: 5.552 ± 0.642
6.374LeuLeu: 6.374 ± 0.812
2.879LeuMet: 2.879 ± 0.546
3.153LeuAsn: 3.153 ± 0.73
2.947LeuPro: 2.947 ± 0.519
2.33LeuGln: 2.33 ± 0.461
5.278LeuArg: 5.278 ± 0.616
6.374LeuSer: 6.374 ± 0.722
5.483LeuThr: 5.483 ± 0.93
3.358LeuVal: 3.358 ± 0.523
0.96LeuTrp: 0.96 ± 0.278
2.33LeuTyr: 2.33 ± 0.493
0.0LeuXaa: 0.0 ± 0.0
Met
3.084MetAla: 3.084 ± 0.4
0.343MetCys: 0.343 ± 0.152
1.714MetAsp: 1.714 ± 0.35
1.782MetGlu: 1.782 ± 0.504
1.165MetPhe: 1.165 ± 0.27
1.439MetGly: 1.439 ± 0.38
0.754MetHis: 0.754 ± 0.286
2.81MetIle: 2.81 ± 0.519
2.742MetLys: 2.742 ± 0.509
2.879MetLeu: 2.879 ± 0.476
1.302MetMet: 1.302 ± 0.328
1.165MetAsn: 1.165 ± 0.265
1.576MetPro: 1.576 ± 0.279
1.782MetGln: 1.782 ± 0.319
2.399MetArg: 2.399 ± 0.481
1.919MetSer: 1.919 ± 0.461
1.645MetThr: 1.645 ± 0.389
1.782MetVal: 1.782 ± 0.274
0.206MetTrp: 0.206 ± 0.119
0.617MetTyr: 0.617 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
4.112AsnAla: 4.112 ± 0.459
0.822AsnCys: 0.822 ± 0.301
2.81AsnAsp: 2.81 ± 0.399
2.262AsnGlu: 2.262 ± 0.437
1.919AsnPhe: 1.919 ± 0.34
4.798AsnGly: 4.798 ± 0.798
0.891AsnHis: 0.891 ± 0.323
3.084AsnIle: 3.084 ± 0.541
3.29AsnLys: 3.29 ± 0.521
2.673AsnLeu: 2.673 ± 0.614
1.371AsnMet: 1.371 ± 0.319
2.536AsnAsn: 2.536 ± 0.565
1.851AsnPro: 1.851 ± 0.37
2.467AsnGln: 2.467 ± 0.601
2.467AsnArg: 2.467 ± 0.443
2.947AsnSer: 2.947 ± 0.421
1.988AsnThr: 1.988 ± 0.381
2.262AsnVal: 2.262 ± 0.402
0.891AsnTrp: 0.891 ± 0.285
1.508AsnTyr: 1.508 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
2.879ProAla: 2.879 ± 0.417
0.343ProCys: 0.343 ± 0.143
2.742ProAsp: 2.742 ± 0.596
2.947ProGlu: 2.947 ± 0.597
1.508ProPhe: 1.508 ± 0.477
1.302ProGly: 1.302 ± 0.368
1.028ProHis: 1.028 ± 0.282
1.782ProIle: 1.782 ± 0.429
1.028ProLys: 1.028 ± 0.272
2.947ProLeu: 2.947 ± 0.534
1.097ProMet: 1.097 ± 0.32
1.576ProAsn: 1.576 ± 0.425
1.097ProPro: 1.097 ± 0.38
1.097ProGln: 1.097 ± 0.281
1.919ProArg: 1.919 ± 0.488
1.645ProSer: 1.645 ± 0.37
1.714ProThr: 1.714 ± 0.348
2.536ProVal: 2.536 ± 0.493
0.343ProTrp: 0.343 ± 0.165
1.919ProTyr: 1.919 ± 0.57
0.0ProXaa: 0.0 ± 0.0
Gln
2.467GlnAla: 2.467 ± 0.417
0.891GlnCys: 0.891 ± 0.233
1.714GlnAsp: 1.714 ± 0.394
2.33GlnGlu: 2.33 ± 0.4
1.714GlnPhe: 1.714 ± 0.337
1.988GlnGly: 1.988 ± 0.442
0.617GlnHis: 0.617 ± 0.244
3.084GlnIle: 3.084 ± 0.718
2.742GlnLys: 2.742 ± 0.415
3.221GlnLeu: 3.221 ± 0.704
1.302GlnMet: 1.302 ± 0.329
1.576GlnAsn: 1.576 ± 0.367
1.439GlnPro: 1.439 ± 0.402
2.879GlnGln: 2.879 ± 0.603
2.467GlnArg: 2.467 ± 0.382
3.564GlnSer: 3.564 ± 0.533
1.782GlnThr: 1.782 ± 0.385
2.879GlnVal: 2.879 ± 0.602
0.685GlnTrp: 0.685 ± 0.226
1.508GlnTyr: 1.508 ± 0.383
0.0GlnXaa: 0.0 ± 0.0
Arg
4.729ArgAla: 4.729 ± 0.793
1.371ArgCys: 1.371 ± 0.304
3.016ArgAsp: 3.016 ± 0.453
2.605ArgGlu: 2.605 ± 0.343
1.851ArgPhe: 1.851 ± 0.433
3.29ArgGly: 3.29 ± 0.48
0.754ArgHis: 0.754 ± 0.229
4.524ArgIle: 4.524 ± 0.576
5.346ArgLys: 5.346 ± 0.681
4.249ArgLeu: 4.249 ± 0.716
2.193ArgMet: 2.193 ± 0.382
2.262ArgAsn: 2.262 ± 0.504
1.851ArgPro: 1.851 ± 0.365
2.399ArgGln: 2.399 ± 0.553
4.729ArgArg: 4.729 ± 0.546
4.249ArgSer: 4.249 ± 0.676
2.056ArgThr: 2.056 ± 0.397
4.935ArgVal: 4.935 ± 0.979
1.097ArgTrp: 1.097 ± 0.28
2.81ArgTyr: 2.81 ± 0.432
0.0ArgXaa: 0.0 ± 0.0
Ser
5.963SerAla: 5.963 ± 0.547
0.822SerCys: 0.822 ± 0.303
3.838SerAsp: 3.838 ± 0.422
5.415SerGlu: 5.415 ± 0.675
3.221SerPhe: 3.221 ± 0.416
6.169SerGly: 6.169 ± 0.627
1.508SerHis: 1.508 ± 0.343
3.701SerIle: 3.701 ± 0.515
3.975SerLys: 3.975 ± 0.567
4.249SerLeu: 4.249 ± 0.684
1.714SerMet: 1.714 ± 0.371
2.879SerAsn: 2.879 ± 0.528
2.399SerPro: 2.399 ± 0.466
1.919SerGln: 1.919 ± 0.528
4.318SerArg: 4.318 ± 0.648
3.084SerSer: 3.084 ± 0.518
3.77SerThr: 3.77 ± 0.542
3.838SerVal: 3.838 ± 0.569
1.097SerTrp: 1.097 ± 0.285
2.056SerTyr: 2.056 ± 0.478
0.0SerXaa: 0.0 ± 0.0
Thr
5.072ThrAla: 5.072 ± 0.63
0.822ThrCys: 0.822 ± 0.25
2.947ThrAsp: 2.947 ± 0.501
3.153ThrGlu: 3.153 ± 0.574
2.605ThrPhe: 2.605 ± 0.37
6.511ThrGly: 6.511 ± 0.836
0.754ThrHis: 0.754 ± 0.223
3.358ThrIle: 3.358 ± 0.602
3.221ThrLys: 3.221 ± 0.743
4.592ThrLeu: 4.592 ± 0.534
1.165ThrMet: 1.165 ± 0.267
3.496ThrAsn: 3.496 ± 0.596
2.399ThrPro: 2.399 ± 0.437
3.221ThrGln: 3.221 ± 0.657
2.81ThrArg: 2.81 ± 0.577
3.701ThrSer: 3.701 ± 0.709
3.221ThrThr: 3.221 ± 0.576
3.358ThrVal: 3.358 ± 0.509
0.411ThrTrp: 0.411 ± 0.183
1.645ThrTyr: 1.645 ± 0.352
0.0ThrXaa: 0.0 ± 0.0
Val
4.044ValAla: 4.044 ± 0.71
1.371ValCys: 1.371 ± 0.336
4.044ValAsp: 4.044 ± 0.49
4.249ValGlu: 4.249 ± 0.641
2.056ValPhe: 2.056 ± 0.44
4.044ValGly: 4.044 ± 0.529
1.028ValHis: 1.028 ± 0.446
4.112ValIle: 4.112 ± 0.725
6.237ValLys: 6.237 ± 0.823
4.935ValLeu: 4.935 ± 0.46
2.262ValMet: 2.262 ± 0.426
3.701ValAsn: 3.701 ± 0.558
1.714ValPro: 1.714 ± 0.447
1.302ValGln: 1.302 ± 0.443
2.81ValArg: 2.81 ± 0.524
4.661ValSer: 4.661 ± 0.497
4.524ValThr: 4.524 ± 0.85
4.387ValVal: 4.387 ± 0.597
0.891ValTrp: 0.891 ± 0.237
2.262ValTyr: 2.262 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
0.685TrpAla: 0.685 ± 0.185
0.274TrpCys: 0.274 ± 0.168
0.754TrpAsp: 0.754 ± 0.252
0.343TrpGlu: 0.343 ± 0.157
0.685TrpPhe: 0.685 ± 0.287
0.754TrpGly: 0.754 ± 0.22
0.685TrpHis: 0.685 ± 0.25
1.028TrpIle: 1.028 ± 0.265
0.822TrpLys: 0.822 ± 0.302
1.576TrpLeu: 1.576 ± 0.357
0.411TrpMet: 0.411 ± 0.155
0.685TrpAsn: 0.685 ± 0.255
0.274TrpPro: 0.274 ± 0.123
1.165TrpGln: 1.165 ± 0.383
1.714TrpArg: 1.714 ± 0.363
1.028TrpSer: 1.028 ± 0.263
0.891TrpThr: 0.891 ± 0.245
0.48TrpVal: 0.48 ± 0.146
0.274TrpTrp: 0.274 ± 0.145
0.274TrpTyr: 0.274 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.262TyrAla: 2.262 ± 0.669
0.274TyrCys: 0.274 ± 0.116
2.125TyrAsp: 2.125 ± 0.41
2.605TyrGlu: 2.605 ± 0.507
1.302TyrPhe: 1.302 ± 0.327
2.125TyrGly: 2.125 ± 0.468
0.754TyrHis: 0.754 ± 0.189
1.851TyrIle: 1.851 ± 0.354
2.33TyrLys: 2.33 ± 0.456
2.262TyrLeu: 2.262 ± 0.449
0.891TyrMet: 0.891 ± 0.271
1.508TyrAsn: 1.508 ± 0.339
1.576TyrPro: 1.576 ± 0.428
1.508TyrGln: 1.508 ± 0.338
2.399TyrArg: 2.399 ± 0.457
2.33TyrSer: 2.33 ± 0.399
1.988TyrThr: 1.988 ± 0.32
1.851TyrVal: 1.851 ± 0.386
0.48TyrTrp: 0.48 ± 0.197
0.96TyrTyr: 0.96 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (14591 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski