Amino acid dipepetide frequency for Hubei narna-like virus 19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.409AlaAla: 5.409 ± 0.971
0.676AlaCys: 0.676 ± 0.48
3.381AlaAsp: 3.381 ± 0.468
6.761AlaGlu: 6.761 ± 0.019
0.676AlaPhe: 0.676 ± 0.476
6.761AlaGly: 6.761 ± 1.931
4.057AlaHis: 4.057 ± 0.012
4.057AlaIle: 4.057 ± 2.879
5.409AlaLys: 5.409 ± 0.971
7.437AlaLeu: 7.437 ± 0.499
1.352AlaMet: 1.352 ± 0.004
2.028AlaAsn: 2.028 ± 0.472
1.352AlaPro: 1.352 ± 0.004
1.352AlaGln: 1.352 ± 0.96
6.761AlaArg: 6.761 ± 1.931
2.705AlaSer: 2.705 ± 0.008
3.381AlaThr: 3.381 ± 1.444
4.057AlaVal: 4.057 ± 0.012
0.0AlaTrp: 0.0 ± 0.0
0.676AlaTyr: 0.676 ± 0.48
0.0AlaXaa: 0.0 ± 0.0
Cys
0.676CysAla: 0.676 ± 0.48
0.676CysCys: 0.676 ± 0.476
2.028CysAsp: 2.028 ± 0.472
1.352CysGlu: 1.352 ± 0.952
0.0CysPhe: 0.0 ± 0.0
0.676CysGly: 0.676 ± 0.48
2.028CysHis: 2.028 ± 0.472
2.028CysIle: 2.028 ± 1.428
0.676CysLys: 0.676 ± 0.476
2.028CysLeu: 2.028 ± 0.472
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.705CysPro: 2.705 ± 1.904
0.676CysGln: 0.676 ± 0.476
1.352CysArg: 1.352 ± 0.952
2.028CysSer: 2.028 ± 0.472
1.352CysThr: 1.352 ± 0.004
1.352CysVal: 1.352 ± 0.96
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.028AspAla: 2.028 ± 0.484
0.0AspCys: 0.0 ± 0.0
4.733AspAsp: 4.733 ± 0.492
2.028AspGlu: 2.028 ± 1.428
2.028AspPhe: 2.028 ± 1.44
4.733AspGly: 4.733 ± 0.464
0.0AspHis: 0.0 ± 0.0
2.705AspIle: 2.705 ± 0.948
1.352AspLys: 1.352 ± 0.004
4.733AspLeu: 4.733 ± 2.403
0.676AspMet: 0.676 ± 0.48
1.352AspAsn: 1.352 ± 0.004
4.057AspPro: 4.057 ± 0.944
0.676AspGln: 0.676 ± 0.476
6.085AspArg: 6.085 ± 0.495
2.705AspSer: 2.705 ± 1.904
1.352AspThr: 1.352 ± 0.004
4.733AspVal: 4.733 ± 2.403
0.0AspTrp: 0.0 ± 0.0
0.676AspTyr: 0.676 ± 0.48
0.0AspXaa: 0.0 ± 0.0
Glu
3.381GluAla: 3.381 ± 1.424
0.0GluCys: 0.0 ± 0.0
1.352GluAsp: 1.352 ± 0.952
4.057GluGlu: 4.057 ± 0.944
0.676GluPhe: 0.676 ± 0.48
3.381GluGly: 3.381 ± 0.488
0.0GluHis: 0.0 ± 0.0
1.352GluIle: 1.352 ± 0.952
2.028GluLys: 2.028 ± 0.484
6.085GluLeu: 6.085 ± 1.416
0.676GluMet: 0.676 ± 0.476
1.352GluAsn: 1.352 ± 0.004
4.057GluPro: 4.057 ± 0.944
0.676GluGln: 0.676 ± 0.48
3.381GluArg: 3.381 ± 1.424
2.028GluSer: 2.028 ± 0.484
4.733GluThr: 4.733 ± 1.42
5.409GluVal: 5.409 ± 1.896
0.676GluTrp: 0.676 ± 0.48
1.352GluTyr: 1.352 ± 0.004
0.0GluXaa: 0.0 ± 0.0
Phe
2.028PheAla: 2.028 ± 1.44
1.352PheCys: 1.352 ± 0.004
1.352PheAsp: 1.352 ± 0.96
2.705PheGlu: 2.705 ± 0.008
0.676PhePhe: 0.676 ± 0.476
1.352PheGly: 1.352 ± 0.004
1.352PheHis: 1.352 ± 0.952
1.352PheIle: 1.352 ± 0.004
1.352PheLys: 1.352 ± 0.004
4.057PheLeu: 4.057 ± 0.944
0.0PheMet: 0.0 ± 0.0
0.676PheAsn: 0.676 ± 0.48
2.028PhePro: 2.028 ± 0.484
0.676PheGln: 0.676 ± 0.48
5.409PheArg: 5.409 ± 0.94
4.733PheSer: 4.733 ± 2.403
0.0PheThr: 0.0 ± 0.0
0.676PheVal: 0.676 ± 0.48
0.0PheTrp: 0.0 ± 0.0
0.676PheTyr: 0.676 ± 0.48
0.0PheXaa: 0.0 ± 0.0
Gly
5.409GlyAla: 5.409 ± 0.94
0.0GlyCys: 0.0 ± 0.0
4.057GlyAsp: 4.057 ± 1.923
3.381GlyGlu: 3.381 ± 2.38
2.028GlyPhe: 2.028 ± 0.484
4.057GlyGly: 4.057 ± 1.923
4.057GlyHis: 4.057 ± 0.968
2.705GlyIle: 2.705 ± 0.964
3.381GlyLys: 3.381 ± 1.424
8.114GlyLeu: 8.114 ± 0.023
0.0GlyMet: 0.0 ± 0.338
2.028GlyAsn: 2.028 ± 1.428
4.057GlyPro: 4.057 ± 2.856
2.705GlyGln: 2.705 ± 0.948
6.761GlyArg: 6.761 ± 0.975
6.085GlySer: 6.085 ± 1.416
5.409GlyThr: 5.409 ± 0.016
6.085GlyVal: 6.085 ± 0.495
0.676GlyTrp: 0.676 ± 0.48
2.705GlyTyr: 2.705 ± 0.008
0.0GlyXaa: 0.0 ± 0.0
His
2.705HisAla: 2.705 ± 0.008
2.028HisCys: 2.028 ± 1.428
1.352HisAsp: 1.352 ± 0.952
2.028HisGlu: 2.028 ± 0.472
0.0HisPhe: 0.0 ± 0.0
1.352HisGly: 1.352 ± 0.96
2.705HisHis: 2.705 ± 0.008
2.028HisIle: 2.028 ± 0.484
0.676HisLys: 0.676 ± 0.476
2.705HisLeu: 2.705 ± 0.008
1.352HisMet: 1.352 ± 0.004
2.028HisAsn: 2.028 ± 0.472
0.676HisPro: 0.676 ± 0.476
0.676HisGln: 0.676 ± 0.48
4.057HisArg: 4.057 ± 0.012
0.676HisSer: 0.676 ± 0.48
1.352HisThr: 1.352 ± 0.952
4.733HisVal: 4.733 ± 1.42
1.352HisTrp: 1.352 ± 0.004
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.057IleAla: 4.057 ± 0.012
0.676IleCys: 0.676 ± 0.48
0.676IleAsp: 0.676 ± 0.476
1.352IleGlu: 1.352 ± 0.952
1.352IlePhe: 1.352 ± 0.96
4.057IleGly: 4.057 ± 0.012
2.028IleHis: 2.028 ± 0.472
3.381IleIle: 3.381 ± 0.488
0.676IleLys: 0.676 ± 0.48
1.352IleLeu: 1.352 ± 0.96
0.676IleMet: 0.676 ± 0.48
3.381IleAsn: 3.381 ± 0.468
4.733IlePro: 4.733 ± 0.492
2.028IleGln: 2.028 ± 1.428
8.79IleArg: 8.79 ± 0.503
5.409IleSer: 5.409 ± 0.94
2.705IleThr: 2.705 ± 1.904
2.705IleVal: 2.705 ± 0.964
1.352IleTrp: 1.352 ± 0.004
1.352IleTyr: 1.352 ± 0.96
0.0IleXaa: 0.0 ± 0.0
Lys
6.761LysAla: 6.761 ± 4.799
1.352LysCys: 1.352 ± 0.004
1.352LysAsp: 1.352 ± 0.004
2.028LysGlu: 2.028 ± 0.484
2.705LysPhe: 2.705 ± 1.92
4.057LysGly: 4.057 ± 0.012
1.352LysHis: 1.352 ± 0.952
0.676LysIle: 0.676 ± 0.48
2.705LysLys: 2.705 ± 0.964
3.381LysLeu: 3.381 ± 0.468
0.0LysMet: 0.0 ± 0.0
0.676LysAsn: 0.676 ± 0.476
0.676LysPro: 0.676 ± 0.48
1.352LysGln: 1.352 ± 0.96
1.352LysArg: 1.352 ± 0.004
4.057LysSer: 4.057 ± 0.012
0.676LysThr: 0.676 ± 0.48
3.381LysVal: 3.381 ± 1.424
0.676LysTrp: 0.676 ± 0.48
0.676LysTyr: 0.676 ± 0.476
0.0LysXaa: 0.0 ± 0.0
Leu
8.114LeuAla: 8.114 ± 0.023
2.705LeuCys: 2.705 ± 1.904
3.381LeuAsp: 3.381 ± 0.488
3.381LeuGlu: 3.381 ± 0.468
1.352LeuPhe: 1.352 ± 0.004
8.114LeuGly: 8.114 ± 1.889
4.057LeuHis: 4.057 ± 1.9
3.381LeuIle: 3.381 ± 0.468
2.028LeuLys: 2.028 ± 0.484
9.466LeuLeu: 9.466 ± 2.841
2.028LeuMet: 2.028 ± 0.472
2.705LeuAsn: 2.705 ± 0.008
5.409LeuPro: 5.409 ± 2.883
2.028LeuGln: 2.028 ± 0.472
6.761LeuArg: 6.761 ± 2.848
6.085LeuSer: 6.085 ± 2.407
4.733LeuThr: 4.733 ± 1.447
4.057LeuVal: 4.057 ± 0.968
1.352LeuTrp: 1.352 ± 0.004
1.352LeuTyr: 1.352 ± 0.952
0.0LeuXaa: 0.0 ± 0.0
Met
0.676MetAla: 0.676 ± 0.48
0.0MetCys: 0.0 ± 0.0
0.676MetAsp: 0.676 ± 0.48
1.352MetGlu: 1.352 ± 0.96
0.676MetPhe: 0.676 ± 0.48
0.676MetGly: 0.676 ± 0.48
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.676MetLys: 0.676 ± 0.48
0.0MetLeu: 0.0 ± 0.0
1.352MetMet: 1.352 ± 0.952
0.676MetAsn: 0.676 ± 0.476
2.028MetPro: 2.028 ± 0.472
0.676MetGln: 0.676 ± 0.48
0.676MetArg: 0.676 ± 0.476
2.028MetSer: 2.028 ± 1.428
3.381MetThr: 3.381 ± 1.444
0.676MetVal: 0.676 ± 0.48
2.028MetTrp: 2.028 ± 0.472
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.381AsnAla: 3.381 ± 0.488
0.0AsnCys: 0.0 ± 0.0
1.352AsnAsp: 1.352 ± 0.952
0.676AsnGlu: 0.676 ± 0.48
1.352AsnPhe: 1.352 ± 0.952
1.352AsnGly: 1.352 ± 0.004
1.352AsnHis: 1.352 ± 0.004
4.733AsnIle: 4.733 ± 0.492
0.676AsnLys: 0.676 ± 0.48
0.676AsnLeu: 0.676 ± 0.476
2.028AsnMet: 2.028 ± 0.472
0.676AsnAsn: 0.676 ± 0.476
2.028AsnPro: 2.028 ± 0.472
0.676AsnGln: 0.676 ± 0.476
3.381AsnArg: 3.381 ± 0.468
4.733AsnSer: 4.733 ± 0.492
1.352AsnThr: 1.352 ± 0.952
4.057AsnVal: 4.057 ± 0.944
0.676AsnTrp: 0.676 ± 0.476
0.676AsnTyr: 0.676 ± 0.476
0.0AsnXaa: 0.0 ± 0.0
Pro
2.028ProAla: 2.028 ± 0.484
0.676ProCys: 0.676 ± 0.48
4.057ProAsp: 4.057 ± 0.968
2.705ProGlu: 2.705 ± 0.008
2.028ProPhe: 2.028 ± 0.472
3.381ProGly: 3.381 ± 0.468
4.057ProHis: 4.057 ± 0.968
2.705ProIle: 2.705 ± 0.948
2.028ProLys: 2.028 ± 0.484
6.085ProLeu: 6.085 ± 1.416
0.676ProMet: 0.676 ± 0.476
3.381ProAsn: 3.381 ± 1.424
2.705ProPro: 2.705 ± 0.964
0.676ProGln: 0.676 ± 0.48
8.79ProArg: 8.79 ± 1.409
4.057ProSer: 4.057 ± 0.012
2.705ProThr: 2.705 ± 0.008
4.733ProVal: 4.733 ± 0.492
2.028ProTrp: 2.028 ± 0.484
2.028ProTyr: 2.028 ± 0.484
0.0ProXaa: 0.0 ± 0.0
Gln
0.676GlnAla: 0.676 ± 0.48
0.676GlnCys: 0.676 ± 0.476
0.676GlnAsp: 0.676 ± 0.48
1.352GlnGlu: 1.352 ± 0.96
1.352GlnPhe: 1.352 ± 0.004
4.057GlnGly: 4.057 ± 1.9
0.0GlnHis: 0.0 ± 0.0
1.352GlnIle: 1.352 ± 0.004
0.676GlnLys: 0.676 ± 0.48
4.733GlnLeu: 4.733 ± 0.464
0.676GlnMet: 0.676 ± 0.48
0.0GlnAsn: 0.0 ± 0.0
3.381GlnPro: 3.381 ± 0.468
0.676GlnGln: 0.676 ± 0.48
2.028GlnArg: 2.028 ± 0.484
0.676GlnSer: 0.676 ± 0.48
1.352GlnThr: 1.352 ± 0.004
0.676GlnVal: 0.676 ± 0.48
2.705GlnTrp: 2.705 ± 1.92
0.676GlnTyr: 0.676 ± 0.476
0.0GlnXaa: 0.0 ± 0.0
Arg
4.733ArgAla: 4.733 ± 1.447
2.705ArgCys: 2.705 ± 0.008
5.409ArgAsp: 5.409 ± 0.94
4.733ArgGlu: 4.733 ± 2.376
4.057ArgPhe: 4.057 ± 0.944
3.381ArgGly: 3.381 ± 0.488
4.057ArgHis: 4.057 ± 1.9
4.733ArgIle: 4.733 ± 1.42
4.733ArgLys: 4.733 ± 0.464
4.733ArgLeu: 4.733 ± 1.42
0.676ArgMet: 0.676 ± 0.48
3.381ArgAsn: 3.381 ± 0.468
8.114ArgPro: 8.114 ± 0.933
4.733ArgGln: 4.733 ± 1.447
4.057ArgArg: 4.057 ± 0.968
9.466ArgSer: 9.466 ± 1.885
6.761ArgThr: 6.761 ± 0.975
6.085ArgVal: 6.085 ± 1.416
2.028ArgTrp: 2.028 ± 0.484
5.409ArgTyr: 5.409 ± 1.927
0.0ArgXaa: 0.0 ± 0.0
Ser
5.409SerAla: 5.409 ± 0.016
3.381SerCys: 3.381 ± 2.38
4.733SerAsp: 4.733 ± 1.447
1.352SerGlu: 1.352 ± 0.952
5.409SerPhe: 5.409 ± 0.94
6.761SerGly: 6.761 ± 3.804
2.028SerHis: 2.028 ± 0.484
4.057SerIle: 4.057 ± 0.012
4.057SerLys: 4.057 ± 0.968
5.409SerLeu: 5.409 ± 0.94
2.028SerMet: 2.028 ± 0.484
5.409SerAsn: 5.409 ± 1.927
0.0SerPro: 0.0 ± 0.0
2.028SerGln: 2.028 ± 0.484
5.409SerArg: 5.409 ± 1.927
8.79SerSer: 8.79 ± 2.365
4.057SerThr: 4.057 ± 0.968
5.409SerVal: 5.409 ± 0.016
2.028SerTrp: 2.028 ± 0.484
2.028SerTyr: 2.028 ± 0.484
0.0SerXaa: 0.0 ± 0.0
Thr
4.057ThrAla: 4.057 ± 0.944
1.352ThrCys: 1.352 ± 0.952
2.705ThrAsp: 2.705 ± 1.92
2.028ThrGlu: 2.028 ± 0.484
2.028ThrPhe: 2.028 ± 0.484
6.085ThrGly: 6.085 ± 0.495
0.676ThrHis: 0.676 ± 0.476
4.057ThrIle: 4.057 ± 0.012
0.676ThrLys: 0.676 ± 0.48
4.733ThrLeu: 4.733 ± 1.447
0.0ThrMet: 0.0 ± 0.0
2.028ThrAsn: 2.028 ± 0.472
4.057ThrPro: 4.057 ± 0.968
2.705ThrGln: 2.705 ± 0.008
4.733ThrArg: 4.733 ± 1.42
5.409ThrSer: 5.409 ± 0.94
3.381ThrThr: 3.381 ± 1.444
5.409ThrVal: 5.409 ± 0.016
0.676ThrTrp: 0.676 ± 0.476
2.028ThrTyr: 2.028 ± 0.484
0.0ThrXaa: 0.0 ± 0.0
Val
1.352ValAla: 1.352 ± 0.004
2.705ValCys: 2.705 ± 0.948
1.352ValAsp: 1.352 ± 0.004
2.705ValGlu: 2.705 ± 0.948
1.352ValPhe: 1.352 ± 0.004
7.437ValGly: 7.437 ± 0.457
0.0ValHis: 0.0 ± 0.0
2.705ValIle: 2.705 ± 0.008
4.733ValLys: 4.733 ± 2.403
4.057ValLeu: 4.057 ± 0.012
2.028ValMet: 2.028 ± 1.44
2.705ValAsn: 2.705 ± 0.008
6.761ValPro: 6.761 ± 1.931
0.676ValGln: 0.676 ± 0.48
10.142ValArg: 10.142 ± 4.273
5.409ValSer: 5.409 ± 1.927
8.114ValThr: 8.114 ± 0.933
7.437ValVal: 7.437 ± 0.499
1.352ValTrp: 1.352 ± 0.004
2.028ValTyr: 2.028 ± 0.484
0.0ValXaa: 0.0 ± 0.0
Trp
4.057TrpAla: 4.057 ± 2.879
0.0TrpCys: 0.0 ± 0.0
2.028TrpAsp: 2.028 ± 1.428
0.676TrpGlu: 0.676 ± 0.476
0.676TrpPhe: 0.676 ± 0.48
0.0TrpGly: 0.0 ± 0.0
0.676TrpHis: 0.676 ± 0.48
0.676TrpIle: 0.676 ± 0.48
0.676TrpLys: 0.676 ± 0.48
2.028TrpLeu: 2.028 ± 0.484
0.676TrpMet: 0.676 ± 0.337
0.676TrpAsn: 0.676 ± 0.476
0.676TrpPro: 0.676 ± 0.48
1.352TrpGln: 1.352 ± 0.952
2.705TrpArg: 2.705 ± 0.008
1.352TrpSer: 1.352 ± 0.96
0.676TrpThr: 0.676 ± 0.48
1.352TrpVal: 1.352 ± 0.96
0.676TrpTrp: 0.676 ± 0.476
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.352TyrAla: 1.352 ± 0.004
0.676TyrCys: 0.676 ± 0.48
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
2.028TyrPhe: 2.028 ± 1.44
2.705TyrGly: 2.705 ± 0.948
0.0TyrHis: 0.0 ± 0.0
4.057TyrIle: 4.057 ± 0.012
0.676TyrLys: 0.676 ± 0.48
0.676TyrLeu: 0.676 ± 0.476
0.676TyrMet: 0.676 ± 0.48
0.676TyrAsn: 0.676 ± 0.476
2.028TyrPro: 2.028 ± 0.484
1.352TyrGln: 1.352 ± 0.96
1.352TyrArg: 1.352 ± 0.96
1.352TyrSer: 1.352 ± 0.004
1.352TyrThr: 1.352 ± 0.004
2.028TyrVal: 2.028 ± 0.472
1.352TyrTrp: 1.352 ± 0.96
3.381TyrTyr: 3.381 ± 0.468
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski