Amino acid dipepetide frequency for Simian immunodeficiency virus - olc

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.912AlaAla: 1.912 ± 0.619
1.53AlaCys: 1.53 ± 0.68
1.147AlaAsp: 1.147 ± 0.407
5.354AlaGlu: 5.354 ± 1.506
1.147AlaPhe: 1.147 ± 0.538
2.294AlaGly: 2.294 ± 0.504
1.912AlaHis: 1.912 ± 1.004
3.442AlaIle: 3.442 ± 0.781
3.824AlaLys: 3.824 ± 1.073
6.501AlaLeu: 6.501 ± 1.375
1.912AlaMet: 1.912 ± 0.526
2.294AlaAsn: 2.294 ± 0.982
2.294AlaPro: 2.294 ± 0.739
2.294AlaGln: 2.294 ± 0.551
1.147AlaArg: 1.147 ± 0.207
4.207AlaSer: 4.207 ± 0.725
2.677AlaThr: 2.677 ± 0.867
4.207AlaVal: 4.207 ± 0.769
1.147AlaTrp: 1.147 ± 0.207
2.294AlaTyr: 2.294 ± 0.572
0.0AlaXaa: 0.0 ± 0.0
Cys
1.147CysAla: 1.147 ± 0.423
0.0CysCys: 0.0 ± 0.0
0.382CysAsp: 0.382 ± 0.269
1.147CysGlu: 1.147 ± 0.507
0.0CysPhe: 0.0 ± 0.0
1.53CysGly: 1.53 ± 0.568
0.382CysHis: 0.382 ± 0.32
1.912CysIle: 1.912 ± 0.526
1.912CysLys: 1.912 ± 0.526
1.53CysLeu: 1.53 ± 0.38
0.0CysMet: 0.0 ± 0.0
1.147CysAsn: 1.147 ± 0.961
0.382CysPro: 0.382 ± 0.42
1.147CysGln: 1.147 ± 0.566
3.059CysArg: 3.059 ± 1.23
0.765CysSer: 0.765 ± 0.62
3.059CysThr: 3.059 ± 1.233
1.147CysVal: 1.147 ± 0.961
0.765CysTrp: 0.765 ± 0.49
2.294CysTyr: 2.294 ± 1.041
0.0CysXaa: 0.0 ± 0.0
Asp
1.147AspAla: 1.147 ± 0.507
1.53AspCys: 1.53 ± 0.38
2.677AspAsp: 2.677 ± 0.448
2.294AspGlu: 2.294 ± 0.763
1.53AspPhe: 1.53 ± 0.862
1.147AspGly: 1.147 ± 0.807
0.382AspHis: 0.382 ± 0.453
3.059AspIle: 3.059 ± 1.425
2.677AspLys: 2.677 ± 0.851
2.677AspLeu: 2.677 ± 0.448
0.0AspMet: 0.0 ± 0.0
2.294AspAsn: 2.294 ± 1.44
2.294AspPro: 2.294 ± 0.415
1.912AspGln: 1.912 ± 0.54
2.677AspArg: 2.677 ± 0.897
1.912AspSer: 1.912 ± 0.619
3.059AspThr: 3.059 ± 0.82
0.765AspVal: 0.765 ± 0.399
2.294AspTrp: 2.294 ± 0.763
1.147AspTyr: 1.147 ± 0.717
0.0AspXaa: 0.0 ± 0.0
Glu
4.589GluAla: 4.589 ± 1.956
1.53GluCys: 1.53 ± 0.497
4.207GluAsp: 4.207 ± 0.909
8.413GluGlu: 8.413 ± 3.242
1.53GluPhe: 1.53 ± 0.538
8.031GluGly: 8.031 ± 2.404
1.53GluHis: 1.53 ± 0.669
5.736GluIle: 5.736 ± 0.587
7.266GluLys: 7.266 ± 1.285
6.501GluLeu: 6.501 ± 1.054
3.442GluMet: 3.442 ± 1.26
3.059GluAsn: 3.059 ± 0.995
3.442GluPro: 3.442 ± 0.571
3.824GluGln: 3.824 ± 0.698
2.294GluArg: 2.294 ± 0.887
2.294GluSer: 2.294 ± 0.751
3.824GluThr: 3.824 ± 0.596
3.059GluVal: 3.059 ± 0.822
2.294GluTrp: 2.294 ± 0.929
2.294GluTyr: 2.294 ± 1.044
0.0GluXaa: 0.0 ± 0.0
Phe
1.53PheAla: 1.53 ± 0.68
1.53PheCys: 1.53 ± 0.603
1.53PheAsp: 1.53 ± 0.81
1.147PheGlu: 1.147 ± 0.207
1.53PhePhe: 1.53 ± 0.38
1.912PheGly: 1.912 ± 0.576
0.0PheHis: 0.0 ± 0.0
0.765PheIle: 0.765 ± 0.446
2.677PheLys: 2.677 ± 0.462
1.912PheLeu: 1.912 ± 0.423
1.147PheMet: 1.147 ± 0.882
1.53PheAsn: 1.53 ± 1.282
0.765PhePro: 0.765 ± 0.604
2.294PheGln: 2.294 ± 0.882
1.147PheArg: 1.147 ± 0.807
2.677PheSer: 2.677 ± 0.673
0.765PheThr: 0.765 ± 0.249
1.53PheVal: 1.53 ± 0.644
1.147PheTrp: 1.147 ± 0.648
1.147PheTyr: 1.147 ± 0.507
0.0PheXaa: 0.0 ± 0.0
Gly
4.589GlyAla: 4.589 ± 1.036
1.147GlyCys: 1.147 ± 0.604
2.294GlyAsp: 2.294 ± 0.347
3.824GlyGlu: 3.824 ± 0.94
1.912GlyPhe: 1.912 ± 0.672
8.031GlyGly: 8.031 ± 2.212
1.912GlyHis: 1.912 ± 0.88
7.266GlyIle: 7.266 ± 1.61
6.501GlyLys: 6.501 ± 2.285
7.266GlyLeu: 7.266 ± 2.468
0.765GlyMet: 0.765 ± 0.636
2.294GlyAsn: 2.294 ± 0.746
3.059GlyPro: 3.059 ± 1.608
3.824GlyGln: 3.824 ± 0.232
4.971GlyArg: 4.971 ± 0.512
2.677GlySer: 2.677 ± 0.308
6.119GlyThr: 6.119 ± 1.083
7.266GlyVal: 7.266 ± 1.829
1.147GlyTrp: 1.147 ± 0.407
2.677GlyTyr: 2.677 ± 0.308
0.0GlyXaa: 0.0 ± 0.0
His
0.382HisAla: 0.382 ± 0.269
0.382HisCys: 0.382 ± 0.32
0.0HisAsp: 0.0 ± 0.0
0.382HisGlu: 0.382 ± 0.269
0.382HisPhe: 0.382 ± 0.42
1.912HisGly: 1.912 ± 0.83
0.0HisHis: 0.0 ± 0.0
1.53HisIle: 1.53 ± 0.73
1.912HisLys: 1.912 ± 0.619
3.824HisLeu: 3.824 ± 0.94
0.382HisMet: 0.382 ± 0.32
1.147HisAsn: 1.147 ± 0.648
0.765HisPro: 0.765 ± 0.383
0.382HisGln: 0.382 ± 0.269
0.765HisArg: 0.765 ± 0.636
0.0HisSer: 0.0 ± 0.0
1.912HisThr: 1.912 ± 0.496
1.53HisVal: 1.53 ± 0.84
0.765HisTrp: 0.765 ± 0.49
0.382HisTyr: 0.382 ± 0.42
0.0HisXaa: 0.0 ± 0.0
Ile
2.294IleAla: 2.294 ± 0.655
1.147IleCys: 1.147 ± 0.538
3.059IleAsp: 3.059 ± 0.702
6.119IleGlu: 6.119 ± 1.715
2.677IlePhe: 2.677 ± 1.287
4.971IleGly: 4.971 ± 0.41
0.765IleHis: 0.765 ± 0.538
2.677IleIle: 2.677 ± 1.102
3.442IleLys: 3.442 ± 1.541
3.824IleLeu: 3.824 ± 0.645
0.382IleMet: 0.382 ± 0.32
1.53IleAsn: 1.53 ± 0.872
4.589IlePro: 4.589 ± 1.543
3.059IleGln: 3.059 ± 0.822
5.354IleArg: 5.354 ± 0.805
1.912IleSer: 1.912 ± 0.777
3.442IleThr: 3.442 ± 1.522
6.501IleVal: 6.501 ± 1.201
1.53IleTrp: 1.53 ± 0.471
2.677IleTyr: 2.677 ± 1.135
0.0IleXaa: 0.0 ± 0.0
Lys
5.736LysAla: 5.736 ± 1.339
1.147LysCys: 1.147 ± 0.717
1.53LysAsp: 1.53 ± 0.644
9.56LysGlu: 9.56 ± 3.754
2.677LysPhe: 2.677 ± 1.086
8.031LysGly: 8.031 ± 1.707
1.53LysHis: 1.53 ± 0.268
5.736LysIle: 5.736 ± 1.46
4.971LysLys: 4.971 ± 1.425
7.648LysLeu: 7.648 ± 1.018
1.53LysMet: 1.53 ± 0.268
2.677LysAsn: 2.677 ± 0.787
3.442LysPro: 3.442 ± 0.622
3.059LysGln: 3.059 ± 1.233
4.207LysArg: 4.207 ± 1.234
2.677LysSer: 2.677 ± 0.462
1.912LysThr: 1.912 ± 0.397
3.442LysVal: 3.442 ± 0.402
2.294LysTrp: 2.294 ± 1.223
2.677LysTyr: 2.677 ± 0.827
0.0LysXaa: 0.0 ± 0.0
Leu
6.119LeuAla: 6.119 ± 1.154
3.824LeuCys: 3.824 ± 0.441
2.294LeuAsp: 2.294 ± 0.751
9.178LeuGlu: 9.178 ± 0.789
1.912LeuPhe: 1.912 ± 1.556
6.883LeuGly: 6.883 ± 1.819
0.0LeuHis: 0.0 ± 0.0
3.442LeuIle: 3.442 ± 0.728
7.648LeuLys: 7.648 ± 1.449
9.56LeuLeu: 9.56 ± 2.178
0.765LeuMet: 0.765 ± 0.249
3.059LeuAsn: 3.059 ± 0.808
4.971LeuPro: 4.971 ± 0.766
6.119LeuGln: 6.119 ± 1.734
5.354LeuArg: 5.354 ± 1.221
3.824LeuSer: 3.824 ± 0.96
2.677LeuThr: 2.677 ± 1.456
6.119LeuVal: 6.119 ± 0.569
1.912LeuTrp: 1.912 ± 0.731
3.824LeuTyr: 3.824 ± 1.226
0.0LeuXaa: 0.0 ± 0.0
Met
1.53MetAla: 1.53 ± 0.544
0.765MetCys: 0.765 ± 0.641
0.765MetAsp: 0.765 ± 0.383
2.294MetGlu: 2.294 ± 1.102
0.765MetPhe: 0.765 ± 0.249
1.53MetGly: 1.53 ± 0.268
0.0MetHis: 0.0 ± 0.0
0.765MetIle: 0.765 ± 0.249
0.765MetLys: 0.765 ± 0.399
1.147MetLeu: 1.147 ± 0.612
1.147MetMet: 1.147 ± 0.961
0.382MetAsn: 0.382 ± 0.32
0.765MetPro: 0.765 ± 0.62
2.677MetGln: 2.677 ± 0.556
0.765MetArg: 0.765 ± 0.641
0.765MetSer: 0.765 ± 0.519
1.53MetThr: 1.53 ± 0.268
1.147MetVal: 1.147 ± 0.566
0.382MetTrp: 0.382 ± 0.32
1.53MetTyr: 1.53 ± 0.976
0.0MetXaa: 0.0 ± 0.0
Asn
2.294AsnAla: 2.294 ± 0.884
1.912AsnCys: 1.912 ± 1.178
1.912AsnAsp: 1.912 ± 0.553
1.53AsnGlu: 1.53 ± 0.38
1.912AsnPhe: 1.912 ± 0.397
3.442AsnGly: 3.442 ± 1.27
0.382AsnHis: 0.382 ± 0.32
3.059AsnIle: 3.059 ± 1.614
3.059AsnLys: 3.059 ± 1.288
2.294AsnLeu: 2.294 ± 0.982
0.765AsnMet: 0.765 ± 0.399
3.059AsnAsn: 3.059 ± 1.23
1.912AsnPro: 1.912 ± 0.423
3.442AsnGln: 3.442 ± 1.222
2.294AsnArg: 2.294 ± 0.746
1.912AsnSer: 1.912 ± 0.777
2.677AsnThr: 2.677 ± 0.522
3.059AsnVal: 3.059 ± 0.541
1.147AsnTrp: 1.147 ± 0.507
1.147AsnTyr: 1.147 ± 0.685
0.0AsnXaa: 0.0 ± 0.0
Pro
2.294ProAla: 2.294 ± 1.259
1.147ProCys: 1.147 ± 0.507
2.294ProAsp: 2.294 ± 0.641
3.442ProGlu: 3.442 ± 0.853
1.147ProPhe: 1.147 ± 0.612
3.824ProGly: 3.824 ± 0.881
0.765ProHis: 0.765 ± 0.446
3.442ProIle: 3.442 ± 0.581
1.912ProLys: 1.912 ± 0.619
5.736ProLeu: 5.736 ± 1.821
1.147ProMet: 1.147 ± 0.648
0.0ProAsn: 0.0 ± 0.0
4.207ProPro: 4.207 ± 0.771
2.677ProGln: 2.677 ± 0.787
3.442ProArg: 3.442 ± 0.767
3.442ProSer: 3.442 ± 0.767
1.912ProThr: 1.912 ± 0.576
2.677ProVal: 2.677 ± 1.525
1.53ProTrp: 1.53 ± 0.497
1.147ProTyr: 1.147 ± 0.407
0.0ProXaa: 0.0 ± 0.0
Gln
1.912GlnAla: 1.912 ± 0.731
1.53GlnCys: 1.53 ± 0.81
1.147GlnAsp: 1.147 ± 0.507
6.501GlnGlu: 6.501 ± 0.817
0.765GlnPhe: 0.765 ± 0.279
4.971GlnGly: 4.971 ± 2.232
1.912GlnHis: 1.912 ± 0.54
3.442GlnIle: 3.442 ± 1.759
5.354GlnLys: 5.354 ± 0.807
4.207GlnLeu: 4.207 ± 0.91
1.53GlnMet: 1.53 ± 0.592
2.294GlnAsn: 2.294 ± 0.809
1.912GlnPro: 1.912 ± 1.192
5.736GlnGln: 5.736 ± 2.522
4.589GlnArg: 4.589 ± 1.535
3.059GlnSer: 3.059 ± 0.401
0.765GlnThr: 0.765 ± 0.641
3.059GlnVal: 3.059 ± 0.512
2.294GlnTrp: 2.294 ± 0.814
1.912GlnTyr: 1.912 ± 0.731
0.0GlnXaa: 0.0 ± 0.0
Arg
3.824ArgAla: 3.824 ± 1.484
1.147ArgCys: 1.147 ± 0.207
2.677ArgAsp: 2.677 ± 0.745
5.736ArgGlu: 5.736 ± 1.312
2.294ArgPhe: 2.294 ± 0.739
2.677ArgGly: 2.677 ± 1.209
2.294ArgHis: 2.294 ± 1.157
4.589ArgIle: 4.589 ± 1.452
4.589ArgLys: 4.589 ± 1.145
4.207ArgLeu: 4.207 ± 0.534
0.0ArgMet: 0.0 ± 0.0
4.207ArgAsn: 4.207 ± 1.42
1.53ArgPro: 1.53 ± 0.766
3.059ArgGln: 3.059 ± 1.261
2.677ArgArg: 2.677 ± 0.78
1.147ArgSer: 1.147 ± 0.423
2.294ArgThr: 2.294 ± 0.415
4.971ArgVal: 4.971 ± 1.258
1.53ArgTrp: 1.53 ± 0.268
1.53ArgTyr: 1.53 ± 0.527
0.0ArgXaa: 0.0 ± 0.0
Ser
3.442SerAla: 3.442 ± 1.204
0.0SerCys: 0.0 ± 0.0
4.207SerAsp: 4.207 ± 1.652
2.294SerGlu: 2.294 ± 0.551
1.147SerPhe: 1.147 ± 0.685
2.677SerGly: 2.677 ± 1.086
1.53SerHis: 1.53 ± 0.81
2.677SerIle: 2.677 ± 0.654
2.677SerLys: 2.677 ± 1.126
6.501SerLeu: 6.501 ± 0.491
0.0SerMet: 0.0 ± 0.0
1.912SerAsn: 1.912 ± 0.496
1.912SerPro: 1.912 ± 0.245
2.677SerGln: 2.677 ± 0.673
1.147SerArg: 1.147 ± 0.395
2.677SerSer: 2.677 ± 0.654
3.059SerThr: 3.059 ± 1.777
1.912SerVal: 1.912 ± 0.731
0.765SerTrp: 0.765 ± 0.62
0.382SerTyr: 0.382 ± 0.269
0.0SerXaa: 0.0 ± 0.0
Thr
3.059ThrAla: 3.059 ± 0.541
1.147ThrCys: 1.147 ± 0.612
1.53ThrAsp: 1.53 ± 1.076
4.207ThrGlu: 4.207 ± 1.358
1.912ThrPhe: 1.912 ± 0.731
6.119ThrGly: 6.119 ± 1.486
1.147ThrHis: 1.147 ± 0.648
2.294ThrIle: 2.294 ± 0.982
3.059ThrLys: 3.059 ± 0.536
4.207ThrLeu: 4.207 ± 0.611
0.382ThrMet: 0.382 ± 0.42
1.147ThrAsn: 1.147 ± 0.407
3.059ThrPro: 3.059 ± 0.856
4.589ThrGln: 4.589 ± 1.203
1.912ThrArg: 1.912 ± 0.576
1.912ThrSer: 1.912 ± 0.54
3.824ThrThr: 3.824 ± 2.251
5.736ThrVal: 5.736 ± 1.414
1.53ThrTrp: 1.53 ± 0.891
1.147ThrTyr: 1.147 ± 0.395
0.0ThrXaa: 0.0 ± 0.0
Val
2.677ValAla: 2.677 ± 1.043
1.147ValCys: 1.147 ± 0.407
3.442ValAsp: 3.442 ± 0.395
3.824ValGlu: 3.824 ± 1.035
1.147ValPhe: 1.147 ± 0.716
5.736ValGly: 5.736 ± 1.796
1.147ValHis: 1.147 ± 0.648
2.677ValIle: 2.677 ± 0.673
7.266ValLys: 7.266 ± 1.635
7.648ValLeu: 7.648 ± 0.456
1.912ValMet: 1.912 ± 0.672
4.971ValAsn: 4.971 ± 0.475
4.207ValPro: 4.207 ± 1.21
1.912ValGln: 1.912 ± 0.95
3.442ValArg: 3.442 ± 1.515
4.589ValSer: 4.589 ± 2.026
3.824ValThr: 3.824 ± 1.756
1.912ValVal: 1.912 ± 0.672
0.382ValTrp: 0.382 ± 0.32
1.912ValTyr: 1.912 ± 0.48
0.0ValXaa: 0.0 ± 0.0
Trp
1.53TrpAla: 1.53 ± 0.268
0.382TrpCys: 0.382 ± 0.269
0.0TrpAsp: 0.0 ± 0.0
1.147TrpGlu: 1.147 ± 0.807
1.147TrpPhe: 1.147 ± 0.961
2.294TrpGly: 2.294 ± 1.066
0.382TrpHis: 0.382 ± 0.453
1.53TrpIle: 1.53 ± 0.813
1.53TrpLys: 1.53 ± 0.644
1.147TrpLeu: 1.147 ± 0.507
2.294TrpMet: 2.294 ± 0.655
1.912TrpAsn: 1.912 ± 0.731
0.765TrpPro: 0.765 ± 0.538
1.912TrpGln: 1.912 ± 0.658
2.294TrpArg: 2.294 ± 0.504
0.382TrpSer: 0.382 ± 0.32
1.912TrpThr: 1.912 ± 0.83
2.294TrpVal: 2.294 ± 1.067
0.765TrpTrp: 0.765 ± 0.49
0.382TrpTyr: 0.382 ± 0.32
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.53TyrAla: 1.53 ± 0.81
0.765TyrCys: 0.765 ± 0.907
0.382TyrAsp: 0.382 ± 0.387
0.765TyrGlu: 0.765 ± 0.249
1.147TyrPhe: 1.147 ± 0.648
1.53TyrGly: 1.53 ± 0.497
0.765TyrHis: 0.765 ± 0.49
1.912TyrIle: 1.912 ± 1.116
3.442TyrLys: 3.442 ± 1.066
1.147TyrLeu: 1.147 ± 0.685
1.53TyrMet: 1.53 ± 0.497
2.294TyrAsn: 2.294 ± 0.347
1.912TyrPro: 1.912 ± 0.397
2.294TyrGln: 2.294 ± 0.269
3.442TyrArg: 3.442 ± 1.502
0.765TyrSer: 0.765 ± 0.383
2.677TyrThr: 2.677 ± 1.043
3.824TyrVal: 3.824 ± 0.856
0.382TyrTrp: 0.382 ± 0.453
0.765TyrTyr: 0.765 ± 0.249
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2616 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski