Amino acid dipepetide frequency for Mapuera orthorubulavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.219AlaAla: 4.219 ± 1.579
1.726AlaCys: 1.726 ± 0.456
2.301AlaAsp: 2.301 ± 0.544
3.644AlaGlu: 3.644 ± 0.753
2.493AlaPhe: 2.493 ± 0.453
1.918AlaGly: 1.918 ± 0.421
1.918AlaHis: 1.918 ± 0.603
4.603AlaIle: 4.603 ± 1.138
4.603AlaLys: 4.603 ± 2.104
7.48AlaLeu: 7.48 ± 0.839
1.534AlaMet: 1.534 ± 0.63
1.918AlaAsn: 1.918 ± 0.349
2.877AlaPro: 2.877 ± 1.412
5.178AlaGln: 5.178 ± 1.436
3.26AlaArg: 3.26 ± 1.053
4.987AlaSer: 4.987 ± 1.22
4.411AlaThr: 4.411 ± 1.193
2.685AlaVal: 2.685 ± 0.986
0.959AlaTrp: 0.959 ± 0.455
2.493AlaTyr: 2.493 ± 0.779
0.0AlaXaa: 0.0 ± 0.0
Cys
1.534CysAla: 1.534 ± 0.406
0.384CysCys: 0.384 ± 0.243
0.192CysAsp: 0.192 ± 0.241
0.959CysGlu: 0.959 ± 0.315
1.151CysPhe: 1.151 ± 0.49
0.575CysGly: 0.575 ± 0.325
0.192CysHis: 0.192 ± 0.213
0.959CysIle: 0.959 ± 0.608
1.343CysLys: 1.343 ± 0.307
1.151CysLeu: 1.151 ± 0.23
0.384CysMet: 0.384 ± 0.317
0.959CysAsn: 0.959 ± 0.332
1.343CysPro: 1.343 ± 0.657
0.959CysGln: 0.959 ± 0.317
0.384CysArg: 0.384 ± 0.333
1.918CysSer: 1.918 ± 0.506
1.534CysThr: 1.534 ± 0.492
1.918CysVal: 1.918 ± 0.591
0.0CysTrp: 0.0 ± 0.0
0.767CysTyr: 0.767 ± 0.327
0.0CysXaa: 0.0 ± 0.0
Asp
3.452AspAla: 3.452 ± 1.245
0.767AspCys: 0.767 ± 0.37
3.836AspAsp: 3.836 ± 0.593
2.877AspGlu: 2.877 ± 1.013
1.343AspPhe: 1.343 ± 0.684
2.493AspGly: 2.493 ± 0.552
0.767AspHis: 0.767 ± 0.349
2.685AspIle: 2.685 ± 0.457
1.534AspLys: 1.534 ± 0.495
6.329AspLeu: 6.329 ± 0.815
0.767AspMet: 0.767 ± 0.345
1.726AspAsn: 1.726 ± 1.094
4.987AspPro: 4.987 ± 1.223
3.644AspGln: 3.644 ± 0.659
1.918AspArg: 1.918 ± 0.782
7.48AspSer: 7.48 ± 1.033
1.343AspThr: 1.343 ± 0.51
2.877AspVal: 2.877 ± 1.037
0.767AspTrp: 0.767 ± 0.37
2.301AspTyr: 2.301 ± 0.463
0.0AspXaa: 0.0 ± 0.0
Glu
2.301GluAla: 2.301 ± 0.604
0.767GluCys: 0.767 ± 0.328
1.918GluAsp: 1.918 ± 0.523
2.493GluGlu: 2.493 ± 0.486
1.343GluPhe: 1.343 ± 0.539
3.069GluGly: 3.069 ± 0.517
0.959GluHis: 0.959 ± 0.603
5.178GluIle: 5.178 ± 0.903
1.918GluLys: 1.918 ± 0.622
5.178GluLeu: 5.178 ± 0.924
1.151GluMet: 1.151 ± 0.341
2.877GluAsn: 2.877 ± 0.558
2.685GluPro: 2.685 ± 0.865
0.959GluGln: 0.959 ± 0.709
2.685GluArg: 2.685 ± 1.142
4.028GluSer: 4.028 ± 1.284
3.069GluThr: 3.069 ± 0.593
3.644GluVal: 3.644 ± 0.837
0.959GluTrp: 0.959 ± 0.492
1.918GluTyr: 1.918 ± 0.415
0.0GluXaa: 0.0 ± 0.0
Phe
2.11PheAla: 2.11 ± 0.466
1.151PheCys: 1.151 ± 0.409
0.959PheAsp: 0.959 ± 0.608
2.301PheGlu: 2.301 ± 0.365
2.493PhePhe: 2.493 ± 0.783
1.151PheGly: 1.151 ± 0.454
0.384PheHis: 0.384 ± 0.206
2.685PheIle: 2.685 ± 0.58
0.959PheLys: 0.959 ± 0.4
3.26PheLeu: 3.26 ± 0.746
0.959PheMet: 0.959 ± 0.442
1.534PheAsn: 1.534 ± 0.614
1.534PhePro: 1.534 ± 0.672
0.767PheGln: 0.767 ± 0.682
1.151PheArg: 1.151 ± 0.419
5.37PheSer: 5.37 ± 1.067
1.918PheThr: 1.918 ± 0.318
1.918PheVal: 1.918 ± 0.634
0.0PheTrp: 0.0 ± 0.0
1.343PheTyr: 1.343 ± 0.565
0.0PheXaa: 0.0 ± 0.0
Gly
3.452GlyAla: 3.452 ± 1.519
1.726GlyCys: 1.726 ± 0.869
3.644GlyAsp: 3.644 ± 0.949
1.534GlyGlu: 1.534 ± 0.498
1.343GlyPhe: 1.343 ± 0.414
3.069GlyGly: 3.069 ± 1.139
1.151GlyHis: 1.151 ± 0.579
3.26GlyIle: 3.26 ± 0.746
2.877GlyLys: 2.877 ± 1.004
4.028GlyLeu: 4.028 ± 0.66
1.343GlyMet: 1.343 ± 0.456
0.767GlyAsn: 0.767 ± 0.247
3.069GlyPro: 3.069 ± 1.208
1.151GlyGln: 1.151 ± 0.483
3.26GlyArg: 3.26 ± 0.634
4.603GlySer: 4.603 ± 1.545
3.836GlyThr: 3.836 ± 1.447
3.26GlyVal: 3.26 ± 1.048
0.959GlyTrp: 0.959 ± 0.523
1.726GlyTyr: 1.726 ± 0.497
0.0GlyXaa: 0.0 ± 0.0
His
1.726HisAla: 1.726 ± 0.645
0.575HisCys: 0.575 ± 0.274
0.767HisAsp: 0.767 ± 0.486
1.726HisGlu: 1.726 ± 0.313
0.0HisPhe: 0.0 ± 0.0
1.343HisGly: 1.343 ± 0.756
0.384HisHis: 0.384 ± 0.243
0.959HisIle: 0.959 ± 0.31
0.767HisLys: 0.767 ± 0.394
3.644HisLeu: 3.644 ± 1.262
0.192HisMet: 0.192 ± 0.237
0.384HisAsn: 0.384 ± 0.207
2.301HisPro: 2.301 ± 0.59
0.575HisGln: 0.575 ± 0.39
1.151HisArg: 1.151 ± 0.431
0.959HisSer: 0.959 ± 0.626
1.918HisThr: 1.918 ± 0.825
0.959HisVal: 0.959 ± 0.293
0.384HisTrp: 0.384 ± 0.342
0.384HisTyr: 0.384 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
3.836IleAla: 3.836 ± 0.928
0.575IleCys: 0.575 ± 0.567
3.836IleAsp: 3.836 ± 0.919
4.795IleGlu: 4.795 ± 0.807
2.11IlePhe: 2.11 ± 0.609
2.877IleGly: 2.877 ± 0.664
1.534IleHis: 1.534 ± 0.699
5.754IleIle: 5.754 ± 1.57
3.836IleLys: 3.836 ± 1.047
7.672IleLeu: 7.672 ± 0.925
2.301IleMet: 2.301 ± 0.783
2.877IleAsn: 2.877 ± 0.677
4.028IlePro: 4.028 ± 0.895
4.795IleGln: 4.795 ± 1.45
1.918IleArg: 1.918 ± 0.445
7.672IleSer: 7.672 ± 1.144
5.946IleThr: 5.946 ± 0.838
4.219IleVal: 4.219 ± 0.931
1.151IleTrp: 1.151 ± 0.553
1.726IleTyr: 1.726 ± 0.36
0.0IleXaa: 0.0 ± 0.0
Lys
2.301LysAla: 2.301 ± 0.481
0.575LysCys: 0.575 ± 0.262
2.301LysAsp: 2.301 ± 0.463
4.219LysGlu: 4.219 ± 0.617
0.959LysPhe: 0.959 ± 0.455
1.918LysGly: 1.918 ± 0.517
1.343LysHis: 1.343 ± 0.491
3.644LysIle: 3.644 ± 1.324
0.959LysLys: 0.959 ± 0.262
4.219LysLeu: 4.219 ± 1.431
1.343LysMet: 1.343 ± 0.971
0.959LysAsn: 0.959 ± 0.278
1.343LysPro: 1.343 ± 0.641
2.685LysGln: 2.685 ± 0.605
2.685LysArg: 2.685 ± 0.64
4.795LysSer: 4.795 ± 1.044
2.685LysThr: 2.685 ± 0.356
3.069LysVal: 3.069 ± 0.721
0.192LysTrp: 0.192 ± 0.122
2.685LysTyr: 2.685 ± 1.03
0.0LysXaa: 0.0 ± 0.0
Leu
8.439LeuAla: 8.439 ± 0.763
1.726LeuCys: 1.726 ± 0.612
6.137LeuAsp: 6.137 ± 0.975
4.987LeuGlu: 4.987 ± 1.161
2.685LeuPhe: 2.685 ± 0.532
5.562LeuGly: 5.562 ± 0.662
1.343LeuHis: 1.343 ± 0.528
6.521LeuIle: 6.521 ± 1.036
5.946LeuLys: 5.946 ± 1.582
9.59LeuLeu: 9.59 ± 2.152
1.918LeuMet: 1.918 ± 0.42
4.219LeuAsn: 4.219 ± 0.828
4.028LeuPro: 4.028 ± 0.769
4.411LeuGln: 4.411 ± 1.564
5.37LeuArg: 5.37 ± 1.457
9.014LeuSer: 9.014 ± 1.41
9.59LeuThr: 9.59 ± 1.913
4.411LeuVal: 4.411 ± 0.627
1.151LeuTrp: 1.151 ± 0.538
3.452LeuTyr: 3.452 ± 0.646
0.0LeuXaa: 0.0 ± 0.0
Met
2.301MetAla: 2.301 ± 0.828
1.151MetCys: 1.151 ± 0.397
1.534MetAsp: 1.534 ± 1.086
1.343MetGlu: 1.343 ± 0.476
0.192MetPhe: 0.192 ± 0.241
0.959MetGly: 0.959 ± 0.478
0.192MetHis: 0.192 ± 0.122
1.534MetIle: 1.534 ± 0.667
1.343MetLys: 1.343 ± 0.601
1.343MetLeu: 1.343 ± 0.378
1.151MetMet: 1.151 ± 0.88
1.918MetAsn: 1.918 ± 0.725
1.151MetPro: 1.151 ± 0.363
1.151MetGln: 1.151 ± 0.424
1.534MetArg: 1.534 ± 0.409
1.918MetSer: 1.918 ± 0.663
1.726MetThr: 1.726 ± 0.618
0.959MetVal: 0.959 ± 0.426
0.192MetTrp: 0.192 ± 0.213
0.767MetTyr: 0.767 ± 0.325
0.0MetXaa: 0.0 ± 0.0
Asn
3.26AsnAla: 3.26 ± 0.175
0.767AsnCys: 0.767 ± 0.304
2.877AsnAsp: 2.877 ± 0.97
1.151AsnGlu: 1.151 ± 0.439
0.959AsnPhe: 0.959 ± 0.3
2.301AsnGly: 2.301 ± 0.475
0.767AsnHis: 0.767 ± 0.344
2.877AsnIle: 2.877 ± 0.583
0.384AsnLys: 0.384 ± 0.326
2.685AsnLeu: 2.685 ± 0.942
0.192AsnMet: 0.192 ± 0.235
2.301AsnAsn: 2.301 ± 0.699
4.219AsnPro: 4.219 ± 0.89
2.877AsnGln: 2.877 ± 0.972
1.726AsnArg: 1.726 ± 0.324
3.452AsnSer: 3.452 ± 0.676
1.534AsnThr: 1.534 ± 0.403
1.726AsnVal: 1.726 ± 0.372
0.959AsnTrp: 0.959 ± 0.31
2.11AsnTyr: 2.11 ± 0.665
0.0AsnXaa: 0.0 ± 0.0
Pro
3.26ProAla: 3.26 ± 1.558
0.0ProCys: 0.0 ± 0.0
3.644ProAsp: 3.644 ± 1.099
3.452ProGlu: 3.452 ± 0.88
2.11ProPhe: 2.11 ± 0.42
3.452ProGly: 3.452 ± 1.116
1.151ProHis: 1.151 ± 0.525
4.603ProIle: 4.603 ± 1.166
2.685ProLys: 2.685 ± 1.134
5.562ProLeu: 5.562 ± 0.683
1.151ProMet: 1.151 ± 0.533
1.918ProAsn: 1.918 ± 0.621
5.178ProPro: 5.178 ± 1.144
3.069ProGln: 3.069 ± 0.946
3.644ProArg: 3.644 ± 0.743
4.411ProSer: 4.411 ± 1.036
4.795ProThr: 4.795 ± 1.616
3.452ProVal: 3.452 ± 1.126
0.384ProTrp: 0.384 ± 0.243
1.343ProTyr: 1.343 ± 0.324
0.0ProXaa: 0.0 ± 0.0
Gln
4.028GlnAla: 4.028 ± 1.114
0.767GlnCys: 0.767 ± 0.412
1.534GlnAsp: 1.534 ± 0.556
2.11GlnGlu: 2.11 ± 0.653
2.877GlnPhe: 2.877 ± 0.655
3.452GlnGly: 3.452 ± 1.283
0.575GlnHis: 0.575 ± 0.426
4.603GlnIle: 4.603 ± 0.952
1.918GlnLys: 1.918 ± 0.432
5.754GlnLeu: 5.754 ± 1.417
1.534GlnMet: 1.534 ± 0.578
3.26GlnAsn: 3.26 ± 1.132
2.301GlnPro: 2.301 ± 0.723
2.301GlnGln: 2.301 ± 0.938
1.726GlnArg: 1.726 ± 0.845
4.028GlnSer: 4.028 ± 1.235
2.685GlnThr: 2.685 ± 0.49
4.028GlnVal: 4.028 ± 1.052
0.384GlnTrp: 0.384 ± 0.206
1.534GlnTyr: 1.534 ± 0.666
0.0GlnXaa: 0.0 ± 0.0
Arg
2.11ArgAla: 2.11 ± 0.424
0.384ArgCys: 0.384 ± 0.4
1.918ArgAsp: 1.918 ± 0.667
1.726ArgGlu: 1.726 ± 0.448
1.918ArgPhe: 1.918 ± 0.64
2.493ArgGly: 2.493 ± 1.064
1.918ArgHis: 1.918 ± 0.478
4.219ArgIle: 4.219 ± 0.901
3.069ArgLys: 3.069 ± 0.469
5.37ArgLeu: 5.37 ± 1.803
0.959ArgMet: 0.959 ± 0.615
0.959ArgAsn: 0.959 ± 0.322
2.301ArgPro: 2.301 ± 0.524
1.534ArgGln: 1.534 ± 0.709
5.37ArgArg: 5.37 ± 1.301
4.987ArgSer: 4.987 ± 0.65
1.918ArgThr: 1.918 ± 0.594
4.028ArgVal: 4.028 ± 1.111
1.534ArgTrp: 1.534 ± 0.607
1.726ArgTyr: 1.726 ± 0.705
0.0ArgXaa: 0.0 ± 0.0
Ser
5.946SerAla: 5.946 ± 1.059
3.069SerCys: 3.069 ± 0.727
6.137SerAsp: 6.137 ± 1.079
3.644SerGlu: 3.644 ± 0.527
3.069SerPhe: 3.069 ± 0.695
4.411SerGly: 4.411 ± 1.173
2.877SerHis: 2.877 ± 0.71
5.37SerIle: 5.37 ± 0.937
3.836SerLys: 3.836 ± 0.912
7.863SerLeu: 7.863 ± 0.979
2.685SerMet: 2.685 ± 0.508
3.452SerAsn: 3.452 ± 1.259
4.987SerPro: 4.987 ± 1.264
4.987SerGln: 4.987 ± 1.039
5.946SerArg: 5.946 ± 0.973
9.014SerSer: 9.014 ± 0.729
4.987SerThr: 4.987 ± 0.992
5.178SerVal: 5.178 ± 1.215
1.534SerTrp: 1.534 ± 0.492
2.301SerTyr: 2.301 ± 0.465
0.0SerXaa: 0.0 ± 0.0
Thr
4.603ThrAla: 4.603 ± 1.249
0.959ThrCys: 0.959 ± 0.315
4.411ThrAsp: 4.411 ± 0.551
1.726ThrGlu: 1.726 ± 0.516
2.11ThrPhe: 2.11 ± 0.52
2.301ThrGly: 2.301 ± 0.378
1.534ThrHis: 1.534 ± 0.599
5.946ThrIle: 5.946 ± 0.574
3.069ThrLys: 3.069 ± 0.387
7.096ThrLeu: 7.096 ± 1.469
1.918ThrMet: 1.918 ± 0.379
2.877ThrAsn: 2.877 ± 0.905
3.069ThrPro: 3.069 ± 0.597
4.219ThrGln: 4.219 ± 0.673
3.069ThrArg: 3.069 ± 0.686
4.987ThrSer: 4.987 ± 1.561
5.37ThrThr: 5.37 ± 1.089
4.219ThrVal: 4.219 ± 1.023
0.575ThrTrp: 0.575 ± 0.245
2.301ThrTyr: 2.301 ± 0.607
0.0ThrXaa: 0.0 ± 0.0
Val
3.452ValAla: 3.452 ± 1.357
0.575ValCys: 0.575 ± 0.288
3.644ValAsp: 3.644 ± 0.336
3.069ValGlu: 3.069 ± 0.829
3.069ValPhe: 3.069 ± 0.255
4.219ValGly: 4.219 ± 1.405
1.534ValHis: 1.534 ± 0.405
3.836ValIle: 3.836 ± 0.832
1.918ValLys: 1.918 ± 0.328
5.178ValLeu: 5.178 ± 0.669
1.918ValMet: 1.918 ± 0.579
2.11ValAsn: 2.11 ± 0.56
4.411ValPro: 4.411 ± 0.923
3.452ValGln: 3.452 ± 1.201
2.493ValArg: 2.493 ± 0.809
4.219ValSer: 4.219 ± 1.001
3.644ValThr: 3.644 ± 0.922
3.26ValVal: 3.26 ± 0.923
0.575ValTrp: 0.575 ± 0.391
2.301ValTyr: 2.301 ± 0.752
0.0ValXaa: 0.0 ± 0.0
Trp
0.767TrpAla: 0.767 ± 0.486
0.384TrpCys: 0.384 ± 0.348
0.575TrpAsp: 0.575 ± 0.244
0.192TrpGlu: 0.192 ± 0.122
0.384TrpPhe: 0.384 ± 0.207
0.959TrpGly: 0.959 ± 0.31
0.384TrpHis: 0.384 ± 0.243
1.534TrpIle: 1.534 ± 0.415
0.767TrpLys: 0.767 ± 0.344
0.575TrpLeu: 0.575 ± 0.29
0.192TrpMet: 0.192 ± 0.122
0.959TrpAsn: 0.959 ± 0.31
0.959TrpPro: 0.959 ± 0.42
0.192TrpGln: 0.192 ± 0.241
0.384TrpArg: 0.384 ± 0.207
0.959TrpSer: 0.959 ± 0.38
1.151TrpThr: 1.151 ± 0.533
1.151TrpVal: 1.151 ± 0.681
0.192TrpTrp: 0.192 ± 0.234
0.192TrpTyr: 0.192 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.11TyrAla: 2.11 ± 0.986
0.575TyrCys: 0.575 ± 0.263
1.726TyrAsp: 1.726 ± 0.945
1.151TyrGlu: 1.151 ± 0.583
1.343TyrPhe: 1.343 ± 0.611
1.534TyrGly: 1.534 ± 0.484
0.192TyrHis: 0.192 ± 0.235
2.685TyrIle: 2.685 ± 0.802
1.343TyrLys: 1.343 ± 0.519
6.521TyrLeu: 6.521 ± 1.634
0.575TyrMet: 0.575 ± 0.27
1.151TyrAsn: 1.151 ± 0.405
2.493TyrPro: 2.493 ± 0.654
2.493TyrGln: 2.493 ± 0.963
0.767TyrArg: 0.767 ± 0.304
2.493TyrSer: 2.493 ± 0.775
2.301TyrThr: 2.301 ± 0.81
1.918TyrVal: 1.918 ± 0.505
0.0TyrTrp: 0.0 ± 0.0
1.918TyrTyr: 1.918 ± 0.581
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (5215 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski