Amino acid dipepetide frequency for Yunnan orbivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.992AlaAla: 3.992 ± 0.834
1.118AlaCys: 1.118 ± 0.201
1.757AlaAsp: 1.757 ± 0.66
4.312AlaGlu: 4.312 ± 1.328
2.395AlaPhe: 2.395 ± 0.643
3.194AlaGly: 3.194 ± 0.804
0.798AlaHis: 0.798 ± 0.293
3.513AlaIle: 3.513 ± 0.99
4.152AlaLys: 4.152 ± 0.871
9.741AlaLeu: 9.741 ± 0.856
1.916AlaMet: 1.916 ± 0.753
2.555AlaAsn: 2.555 ± 0.716
2.715AlaPro: 2.715 ± 0.739
2.076AlaGln: 2.076 ± 0.798
3.992AlaArg: 3.992 ± 0.867
5.11AlaSer: 5.11 ± 0.832
4.471AlaThr: 4.471 ± 1.288
3.673AlaVal: 3.673 ± 0.62
0.958AlaTrp: 0.958 ± 0.362
1.757AlaTyr: 1.757 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
0.958CysAla: 0.958 ± 0.397
0.16CysCys: 0.16 ± 0.169
0.798CysAsp: 0.798 ± 0.415
0.479CysGlu: 0.479 ± 0.211
0.798CysPhe: 0.798 ± 0.355
0.319CysGly: 0.319 ± 0.207
0.0CysHis: 0.0 ± 0.0
0.798CysIle: 0.798 ± 0.281
0.798CysLys: 0.798 ± 0.41
1.597CysLeu: 1.597 ± 0.542
0.319CysMet: 0.319 ± 0.19
0.958CysAsn: 0.958 ± 0.381
0.0CysPro: 0.0 ± 0.0
0.798CysGln: 0.798 ± 0.309
0.479CysArg: 0.479 ± 0.219
0.798CysSer: 0.798 ± 0.201
0.958CysThr: 0.958 ± 0.39
0.479CysVal: 0.479 ± 0.241
0.16CysTrp: 0.16 ± 0.129
1.278CysTyr: 1.278 ± 0.597
0.0CysXaa: 0.0 ± 0.0
Asp
3.354AspAla: 3.354 ± 0.602
0.639AspCys: 0.639 ± 0.294
3.513AspAsp: 3.513 ± 0.884
4.791AspGlu: 4.791 ± 1.01
1.757AspPhe: 1.757 ± 0.514
2.236AspGly: 2.236 ± 0.493
0.798AspHis: 0.798 ± 0.405
3.354AspIle: 3.354 ± 1.014
3.513AspLys: 3.513 ± 0.709
6.547AspLeu: 6.547 ± 0.772
1.278AspMet: 1.278 ± 0.442
0.798AspAsn: 0.798 ± 0.28
3.194AspPro: 3.194 ± 0.526
2.076AspGln: 2.076 ± 0.483
3.194AspArg: 3.194 ± 0.889
4.631AspSer: 4.631 ± 0.72
3.833AspThr: 3.833 ± 0.677
5.11AspVal: 5.11 ± 0.667
0.639AspTrp: 0.639 ± 0.489
1.437AspTyr: 1.437 ± 0.458
0.0AspXaa: 0.0 ± 0.0
Glu
5.11GluAla: 5.11 ± 0.895
0.639GluCys: 0.639 ± 0.255
3.194GluAsp: 3.194 ± 0.562
5.43GluGlu: 5.43 ± 1.045
1.916GluPhe: 1.916 ± 0.477
2.555GluGly: 2.555 ± 0.481
1.597GluHis: 1.597 ± 0.547
3.992GluIle: 3.992 ± 0.652
4.95GluLys: 4.95 ± 1.022
3.513GluLeu: 3.513 ± 0.992
2.076GluMet: 2.076 ± 0.568
2.874GluAsn: 2.874 ± 0.917
2.874GluPro: 2.874 ± 0.602
1.916GluGln: 1.916 ± 0.718
3.513GluArg: 3.513 ± 0.57
4.631GluSer: 4.631 ± 0.825
5.909GluThr: 5.909 ± 0.83
4.471GluVal: 4.471 ± 0.676
0.479GluTrp: 0.479 ± 0.27
1.916GluTyr: 1.916 ± 0.769
0.0GluXaa: 0.0 ± 0.0
Phe
1.916PheAla: 1.916 ± 0.622
0.958PheCys: 0.958 ± 0.545
2.236PheAsp: 2.236 ± 0.499
3.673PheGlu: 3.673 ± 1.022
0.958PhePhe: 0.958 ± 0.401
3.034PheGly: 3.034 ± 0.58
1.118PheHis: 1.118 ± 0.355
2.236PheIle: 2.236 ± 0.456
2.395PheLys: 2.395 ± 0.56
4.471PheLeu: 4.471 ± 0.714
0.479PheMet: 0.479 ± 0.234
1.757PheAsn: 1.757 ± 0.545
1.118PhePro: 1.118 ± 0.206
1.597PheGln: 1.597 ± 0.439
2.236PheArg: 2.236 ± 0.805
2.874PheSer: 2.874 ± 0.798
2.076PheThr: 2.076 ± 0.369
3.194PheVal: 3.194 ± 0.496
0.16PheTrp: 0.16 ± 0.129
1.437PheTyr: 1.437 ± 0.455
0.0PheXaa: 0.0 ± 0.0
Gly
2.715GlyAla: 2.715 ± 0.911
0.479GlyCys: 0.479 ± 0.28
2.715GlyAsp: 2.715 ± 1.0
2.715GlyGlu: 2.715 ± 0.454
2.555GlyPhe: 2.555 ± 0.591
3.034GlyGly: 3.034 ± 0.681
1.118GlyHis: 1.118 ± 0.369
3.992GlyIle: 3.992 ± 0.686
4.312GlyLys: 4.312 ± 1.133
2.874GlyLeu: 2.874 ± 0.899
1.757GlyMet: 1.757 ± 0.698
1.757GlyAsn: 1.757 ± 0.652
3.034GlyPro: 3.034 ± 0.682
2.555GlyGln: 2.555 ± 0.703
2.874GlyArg: 2.874 ± 0.558
3.833GlySer: 3.833 ± 0.572
3.513GlyThr: 3.513 ± 0.562
2.715GlyVal: 2.715 ± 0.677
0.319GlyTrp: 0.319 ± 0.235
2.874GlyTyr: 2.874 ± 0.482
0.0GlyXaa: 0.0 ± 0.0
His
1.597HisAla: 1.597 ± 0.446
0.16HisCys: 0.16 ± 0.144
1.437HisAsp: 1.437 ± 0.439
0.958HisGlu: 0.958 ± 0.478
1.437HisPhe: 1.437 ± 0.467
1.757HisGly: 1.757 ± 0.61
0.479HisHis: 0.479 ± 0.241
1.757HisIle: 1.757 ± 0.491
1.278HisLys: 1.278 ± 0.463
2.395HisLeu: 2.395 ± 0.666
0.319HisMet: 0.319 ± 0.207
0.798HisAsn: 0.798 ± 0.3
1.118HisPro: 1.118 ± 0.391
0.798HisGln: 0.798 ± 0.362
1.437HisArg: 1.437 ± 0.393
1.597HisSer: 1.597 ± 0.353
1.118HisThr: 1.118 ± 0.531
1.916HisVal: 1.916 ± 0.433
0.319HisTrp: 0.319 ± 0.19
0.479HisTyr: 0.479 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
2.555IleAla: 2.555 ± 0.582
0.798IleCys: 0.798 ± 0.301
3.833IleAsp: 3.833 ± 0.506
3.513IleGlu: 3.513 ± 0.747
3.354IlePhe: 3.354 ± 0.669
3.992IleGly: 3.992 ± 0.612
1.916IleHis: 1.916 ± 0.561
2.874IleIle: 2.874 ± 0.711
3.673IleLys: 3.673 ± 0.727
5.11IleLeu: 5.11 ± 0.921
1.597IleMet: 1.597 ± 0.642
2.076IleAsn: 2.076 ± 0.83
2.076IlePro: 2.076 ± 0.456
2.555IleGln: 2.555 ± 0.56
3.992IleArg: 3.992 ± 0.807
4.471IleSer: 4.471 ± 0.31
5.43IleThr: 5.43 ± 0.661
4.95IleVal: 4.95 ± 0.668
0.798IleTrp: 0.798 ± 0.603
2.715IleTyr: 2.715 ± 0.639
0.0IleXaa: 0.0 ± 0.0
Lys
4.631LysAla: 4.631 ± 0.976
1.437LysCys: 1.437 ± 0.588
3.034LysAsp: 3.034 ± 0.581
3.833LysGlu: 3.833 ± 0.68
2.076LysPhe: 2.076 ± 0.537
3.194LysGly: 3.194 ± 0.572
1.278LysHis: 1.278 ± 0.515
5.27LysIle: 5.27 ± 1.135
3.833LysLys: 3.833 ± 0.711
6.388LysLeu: 6.388 ± 1.034
2.395LysMet: 2.395 ± 0.485
3.833LysAsn: 3.833 ± 0.791
1.916LysPro: 1.916 ± 0.495
1.916LysGln: 1.916 ± 0.469
5.43LysArg: 5.43 ± 1.161
2.236LysSer: 2.236 ± 0.604
3.194LysThr: 3.194 ± 0.974
3.194LysVal: 3.194 ± 0.878
0.479LysTrp: 0.479 ± 0.238
1.437LysTyr: 1.437 ± 0.522
0.0LysXaa: 0.0 ± 0.0
Leu
6.388LeuAla: 6.388 ± 0.834
1.118LeuCys: 1.118 ± 0.454
5.909LeuAsp: 5.909 ± 0.839
4.152LeuGlu: 4.152 ± 0.596
3.354LeuPhe: 3.354 ± 0.844
4.791LeuGly: 4.791 ± 0.816
2.076LeuHis: 2.076 ± 0.652
4.631LeuIle: 4.631 ± 1.162
5.27LeuLys: 5.27 ± 0.731
10.061LeuLeu: 10.061 ± 1.54
3.194LeuMet: 3.194 ± 0.462
4.631LeuAsn: 4.631 ± 1.026
5.909LeuPro: 5.909 ± 0.75
5.27LeuGln: 5.27 ± 0.39
7.027LeuArg: 7.027 ± 1.844
6.707LeuSer: 6.707 ± 0.587
6.547LeuThr: 6.547 ± 0.938
6.388LeuVal: 6.388 ± 1.064
0.798LeuTrp: 0.798 ± 0.473
2.715LeuTyr: 2.715 ± 0.901
0.0LeuXaa: 0.0 ± 0.0
Met
3.513MetAla: 3.513 ± 1.037
0.798MetCys: 0.798 ± 0.287
2.076MetAsp: 2.076 ± 0.682
2.076MetGlu: 2.076 ± 0.427
0.639MetPhe: 0.639 ± 0.27
0.798MetGly: 0.798 ± 0.243
0.798MetHis: 0.798 ± 0.345
1.597MetIle: 1.597 ± 0.597
1.597MetLys: 1.597 ± 0.678
3.354MetLeu: 3.354 ± 0.847
1.916MetMet: 1.916 ± 0.411
1.916MetAsn: 1.916 ± 0.442
1.278MetPro: 1.278 ± 0.423
1.278MetGln: 1.278 ± 0.498
1.757MetArg: 1.757 ± 0.374
2.874MetSer: 2.874 ± 0.555
0.958MetThr: 0.958 ± 0.542
0.479MetVal: 0.479 ± 0.227
0.319MetTrp: 0.319 ± 0.19
0.798MetTyr: 0.798 ± 0.249
0.0MetXaa: 0.0 ± 0.0
Asn
4.152AsnAla: 4.152 ± 0.714
0.319AsnCys: 0.319 ± 0.216
2.236AsnAsp: 2.236 ± 0.77
2.555AsnGlu: 2.555 ± 0.605
2.555AsnPhe: 2.555 ± 0.361
1.757AsnGly: 1.757 ± 0.406
0.798AsnHis: 0.798 ± 0.316
2.555AsnIle: 2.555 ± 0.587
1.916AsnLys: 1.916 ± 0.538
4.791AsnLeu: 4.791 ± 0.843
0.639AsnMet: 0.639 ± 0.298
1.757AsnAsn: 1.757 ± 0.516
1.916AsnPro: 1.916 ± 0.641
1.916AsnGln: 1.916 ± 0.643
2.236AsnArg: 2.236 ± 0.569
1.757AsnSer: 1.757 ± 0.442
2.076AsnThr: 2.076 ± 0.567
3.513AsnVal: 3.513 ± 0.551
0.0AsnTrp: 0.0 ± 0.0
2.395AsnTyr: 2.395 ± 0.352
0.0AsnXaa: 0.0 ± 0.0
Pro
2.715ProAla: 2.715 ± 0.653
0.639ProCys: 0.639 ± 0.337
2.236ProAsp: 2.236 ± 0.609
3.034ProGlu: 3.034 ± 0.544
1.437ProPhe: 1.437 ± 0.414
2.236ProGly: 2.236 ± 0.601
1.597ProHis: 1.597 ± 0.459
3.673ProIle: 3.673 ± 0.649
1.916ProLys: 1.916 ± 0.491
2.874ProLeu: 2.874 ± 0.734
1.278ProMet: 1.278 ± 0.484
2.395ProAsn: 2.395 ± 0.888
2.395ProPro: 2.395 ± 0.586
1.916ProGln: 1.916 ± 0.5
2.395ProArg: 2.395 ± 0.749
1.757ProSer: 1.757 ± 0.645
3.194ProThr: 3.194 ± 0.473
2.874ProVal: 2.874 ± 0.698
0.639ProTrp: 0.639 ± 0.266
2.076ProTyr: 2.076 ± 0.488
0.0ProXaa: 0.0 ± 0.0
Gln
2.395GlnAla: 2.395 ± 0.707
0.479GlnCys: 0.479 ± 0.224
1.916GlnAsp: 1.916 ± 0.485
2.874GlnGlu: 2.874 ± 0.738
1.278GlnPhe: 1.278 ± 0.349
1.916GlnGly: 1.916 ± 0.433
0.958GlnHis: 0.958 ± 0.197
3.194GlnIle: 3.194 ± 0.778
3.513GlnLys: 3.513 ± 0.763
3.513GlnLeu: 3.513 ± 0.626
1.437GlnMet: 1.437 ± 0.488
2.555GlnAsn: 2.555 ± 0.603
1.916GlnPro: 1.916 ± 0.447
2.236GlnGln: 2.236 ± 0.52
1.757GlnArg: 1.757 ± 0.456
2.395GlnSer: 2.395 ± 0.562
2.874GlnThr: 2.874 ± 0.444
2.555GlnVal: 2.555 ± 0.721
0.319GlnTrp: 0.319 ± 0.196
1.278GlnTyr: 1.278 ± 0.408
0.0GlnXaa: 0.0 ± 0.0
Arg
4.791ArgAla: 4.791 ± 1.068
0.639ArgCys: 0.639 ± 0.33
4.152ArgAsp: 4.152 ± 0.635
4.471ArgGlu: 4.471 ± 1.08
2.555ArgPhe: 2.555 ± 0.454
4.631ArgGly: 4.631 ± 0.701
1.118ArgHis: 1.118 ± 0.404
3.513ArgIle: 3.513 ± 0.415
3.194ArgLys: 3.194 ± 0.678
7.027ArgLeu: 7.027 ± 0.952
1.597ArgMet: 1.597 ± 0.53
1.597ArgAsn: 1.597 ± 0.454
1.597ArgPro: 1.597 ± 0.555
2.555ArgGln: 2.555 ± 0.369
3.992ArgArg: 3.992 ± 0.672
4.152ArgSer: 4.152 ± 0.816
3.673ArgThr: 3.673 ± 0.373
4.95ArgVal: 4.95 ± 1.092
0.479ArgTrp: 0.479 ± 0.242
1.118ArgTyr: 1.118 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
5.11SerAla: 5.11 ± 1.336
0.16SerCys: 0.16 ± 0.155
5.27SerAsp: 5.27 ± 0.669
3.992SerGlu: 3.992 ± 0.532
2.555SerPhe: 2.555 ± 0.598
3.673SerGly: 3.673 ± 0.805
2.236SerHis: 2.236 ± 0.526
3.833SerIle: 3.833 ± 0.821
4.312SerLys: 4.312 ± 1.114
5.11SerLeu: 5.11 ± 0.811
3.833SerMet: 3.833 ± 0.711
2.076SerAsn: 2.076 ± 0.244
3.034SerPro: 3.034 ± 0.61
3.194SerGln: 3.194 ± 0.551
4.312SerArg: 4.312 ± 1.15
4.631SerSer: 4.631 ± 0.556
4.791SerThr: 4.791 ± 0.677
3.513SerVal: 3.513 ± 0.705
0.798SerTrp: 0.798 ± 0.472
3.194SerTyr: 3.194 ± 1.03
0.0SerXaa: 0.0 ± 0.0
Thr
4.152ThrAla: 4.152 ± 0.64
0.319ThrCys: 0.319 ± 0.256
2.874ThrAsp: 2.874 ± 1.084
4.631ThrGlu: 4.631 ± 0.993
3.034ThrPhe: 3.034 ± 0.558
3.833ThrGly: 3.833 ± 0.922
1.916ThrHis: 1.916 ± 0.549
4.791ThrIle: 4.791 ± 0.97
3.194ThrLys: 3.194 ± 0.743
6.547ThrLeu: 6.547 ± 1.097
1.118ThrMet: 1.118 ± 0.507
2.874ThrAsn: 2.874 ± 0.618
2.555ThrPro: 2.555 ± 0.468
3.034ThrGln: 3.034 ± 0.454
4.95ThrArg: 4.95 ± 1.066
5.909ThrSer: 5.909 ± 0.766
4.471ThrThr: 4.471 ± 1.283
3.992ThrVal: 3.992 ± 0.843
0.798ThrTrp: 0.798 ± 0.223
2.236ThrTyr: 2.236 ± 0.437
0.0ThrXaa: 0.0 ± 0.0
Val
2.236ValAla: 2.236 ± 0.445
1.118ValCys: 1.118 ± 0.355
4.631ValAsp: 4.631 ± 0.614
4.471ValGlu: 4.471 ± 0.789
3.034ValPhe: 3.034 ± 0.52
2.715ValGly: 2.715 ± 0.88
1.916ValHis: 1.916 ± 0.354
4.152ValIle: 4.152 ± 0.631
3.513ValLys: 3.513 ± 0.554
5.749ValLeu: 5.749 ± 0.374
2.555ValMet: 2.555 ± 0.345
2.076ValAsn: 2.076 ± 0.818
2.715ValPro: 2.715 ± 0.617
2.395ValGln: 2.395 ± 0.549
3.992ValArg: 3.992 ± 0.773
5.11ValSer: 5.11 ± 0.908
5.43ValThr: 5.43 ± 0.903
3.833ValVal: 3.833 ± 1.093
0.958ValTrp: 0.958 ± 0.353
2.874ValTyr: 2.874 ± 0.519
0.0ValXaa: 0.0 ± 0.0
Trp
0.639TrpAla: 0.639 ± 0.343
0.16TrpCys: 0.16 ± 0.148
0.639TrpAsp: 0.639 ± 0.282
0.479TrpGlu: 0.479 ± 0.225
1.118TrpPhe: 1.118 ± 0.352
0.479TrpGly: 0.479 ± 0.316
0.16TrpHis: 0.16 ± 0.169
0.798TrpIle: 0.798 ± 0.44
1.118TrpLys: 1.118 ± 0.375
0.958TrpLeu: 0.958 ± 0.464
0.0TrpMet: 0.0 ± 0.0
0.958TrpAsn: 0.958 ± 0.519
0.16TrpPro: 0.16 ± 0.155
0.479TrpGln: 0.479 ± 0.254
0.479TrpArg: 0.479 ± 0.201
0.16TrpSer: 0.16 ± 0.169
0.479TrpThr: 0.479 ± 0.201
0.319TrpVal: 0.319 ± 0.147
0.319TrpTrp: 0.319 ± 0.19
0.479TrpTyr: 0.479 ± 0.331
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.278TyrAla: 1.278 ± 0.328
0.798TyrCys: 0.798 ± 0.327
2.236TyrAsp: 2.236 ± 0.423
1.118TyrGlu: 1.118 ± 0.36
1.118TyrPhe: 1.118 ± 0.223
1.597TyrGly: 1.597 ± 0.615
0.479TyrHis: 0.479 ± 0.196
1.757TyrIle: 1.757 ± 0.652
2.555TyrLys: 2.555 ± 0.303
4.471TyrLeu: 4.471 ± 1.255
0.958TyrMet: 0.958 ± 0.334
1.597TyrAsn: 1.597 ± 0.564
1.757TyrPro: 1.757 ± 0.461
0.798TyrGln: 0.798 ± 0.271
1.757TyrArg: 1.757 ± 0.498
3.992TyrSer: 3.992 ± 0.695
2.236TyrThr: 2.236 ± 0.643
3.354TyrVal: 3.354 ± 0.43
0.639TyrTrp: 0.639 ± 0.322
1.278TyrTyr: 1.278 ± 0.225
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6263 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski