Arithmetic progression
We shall now give some mathematical evidences that will prove that in the biochemistry of BRCA1 and RASSF1A in EOC tissues there really is programmatic and cybernetic algorithm in which it is „recorded“, in the language of mathematics, how the molecule will be built and what will be the quantitative characteristics of the given genetic information.
Primer sequences
|
|
|
| Progression of atomic numbers |
|
| |||||
5’ | G | G | T | T | A | A | T | T | T | A | G |
|
| 156 | 222 | 288 |
|
| 494 | 560 |
| 696 | 774 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1696 | 1618 | 1552 |
|
| 1346 | 1280 |
| 1148 | 1078 |
5’ | T | C | A | A | C | A | A | A | C | T | C |
|
| 124 | 194 | 264 |
|
| 462 | 532 |
| 656 | 714 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1552 | 1494 | 1424 |
|
| 1226 | 1156 |
| 1028 | 962 |
5’ | G | G | T | T | A | A | T | T | T | A | G |
|
| 156 | 222 | 288 |
|
| 494 | 560 |
| 696 | 774 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1680 | 1602 | 1536 |
|
| 1330 | 1264 |
| 1132 | 1062 |
5’ | T | C | A | A | C | G | A | A | C | T | C |
|
| 124 | 194 | 264 |
|
| 470 | 540 |
| 664 | 722 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1592 | 1534 | 1464 |
|
| 1258 | 1188 |
| 1060 | 994 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7080 | 7080 | 7080 |
|
| 7080 | 7080 |
| 7080 | 7080 |
|
|
|
|
| Progression of atomic numbers |
|
|
| ||||||
5’ | …… | A | G | A | G | T | T | T | T | G | A | G | A | |
|
|
| 774 |
| 922 |
|
|
|
| 1264 |
| 1412 |
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
| 1078 |
| 930 |
|
|
|
| 588 |
| 440 |
| |
5’ | …… | T | C | A | C | A | C | C | A | C | A | C | A | |
|
|
| 714 |
| 842 |
|
|
|
| 1156 |
| 1284 |
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
| 962 |
| 834 |
|
|
|
| 520 |
| 392 |
| |
5’ | …… | A | G | A | G | T | T | T | C | G | A | G | A | |
|
|
| 774 |
| 922 |
|
|
|
| 1256 |
| 1404 |
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
| 1062 |
| 914 |
|
|
|
| 580 |
| 432 |
| |
5’ | …… |
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
| 5364 |
| 5364 |
|
|
|
| 5364 |
| 5364 |
| |
Figure 5. Progression of atomic numbers of BRCA1 and RASSF1A in EOC tissues
Notes: By using chemical-information procedures, we calculated the arithmetic progression for the information content of aforementioned nucleotides.
We would particularly like to stress here that the genetic, as well as biochemical information in a broader sense of the word, is determined and characterized by very complex cybernetic and information principles. The constantans in those principles are: the number of atoms and molecules, atomic numbers, atomic weight, physical and chemical parameters, even and odd values, codes and analogue codes, standard deviations, frequencies, primary and secondary values, and many other things.
DISCUSSION
The results of our research show that the processes of sequencing the molecules are conditioned and arranged not only with chemical and biochemical lawfulness, but also with program, cybernetic and informational lawfulness too. At the first stage of our research we replaced nucleotides from the Amino Acid Code Matrix with numbers of the atoms and atomic numbers in those nucleotides. Translation of the biochemical language of these amino acids into a digital language may be very useful for developing new methods of predicting protein sub-cellular localization, membrane protein type, protein structure secondary prediction or any other protein attributes. Since the concept of Chou's pseudo amino acid composition was proposed 1,2, there have been many efforts to try to use various digital numbers to represent the 20 native amino acids in order to better reflect the sequence-order effects through the vehicle of pseudo amino acid composition. Some investigators used complexity measure factor 3, some used the values derived from the cellular automata 4-7, some used hydrophobic and/or hydrophilic values 8-16, some were through Fourier transform 17,18, and some used the physicochemical distance 19. The author [34-42] is devoted to provide a digital code for each of 20 native amino acids. These digital codes should more complete and better reflect the essence of each of the 20 amino acids. Therefore, it might stimulate a series of future work by using the author’s digital codes to formulate the pseudo amino acid composition for predicting protein structure class [20-22], subcellular location [23, 24], membrane protein type [9, 25], enzyme family class [26, 27], GPCR type [28, 29], protease type [30], protein-protein interaction [31], metabolic pathways [32], protein quaternary structure [33], and other protein attributes. It is going to be possible to use a completely new strategy of research in genetics in the future. However, close observation of all these relationships, which are the outcomes of periodic laws (more specifically the law of binary coding), stereo-chemical and digital structure of proteins.
1. K.C. Chou, Gene Cloning & Expression Technologies, Chapter 4 (Weinrer, P.W.,
and Lu, Q., Eds.), Eaton Publishing, Westborough, MA (2002), pp. 57-70.
2. K.C. Chou, Prediction of protein cellular attributes using pseudo amino acid
composition PROTEINS: Structure, Function, and Genetics (Erratum: ibid., 2001,
Vol.44,60) 43 (2001) 246-255.
3. X. Xiao, S. Shao, Y. Ding, Z. Huang, Y. Huang, K. C. Chou, Using complexity
measure factor to predict protein subcellular location, Amino Acids 28 (2005) 57-
61.
4. X. Xiao, S. Shao, Y. Ding, Z. Huang, X. Chen, K. C. Chou, Using cellular automata
to generate Image representation for biological sequences, Amino Acids 28 (2005)
29-35.
5. X. Xiao, S. Shao, Y. Ding, Z. Huang, X. Chen, K. C. Chou, An Application of Gene
Comparative Image for Predicting the Effect on Replication Ratio by HBV Virus
Gene Missense Mutation, Journal of Theoretical Biology 235 (2005) 555-565.
6. X. Xiao, S. H. Shao, Z. D. Huang, K. C. Chou, Using pseudo amino acid
composition to predict protein structural classes: approached with complexity
measure factor, Journal of Computational Chemistry 27 (2006) 478-482.
7. X. Xiao, S. H. Shao, Y. S. Ding, Z. D. Huang, K. C. Chou, Using cellular automata
images and pseudo amino acid composition to predict protein sub-cellular location,
Amino Acids 30 (2006) 49-54.
8. K. C. Chou, Using amphiphilic pseudo amino acid composition to predict enzyme
subfamily classes, Bioinformatics 21 (2005) 10-19.
9. K. C. Chou, Y. D. Cai, Prediction of membrane protein types by incorporating
amphipathic effects, Journal of Chemical Information and Modeling 45 (2005) 407-
413.
10. Z. P. Feng, Prediction of the subcellular location of prokaryotic proteins based on a
new representation of the amino acid composition, Biopolymers 58 (2001) 491-499.
11. Z. P. Feng, An overview on predicting the subcellular location of a protein, In Silico
Biol 2 (2002) 291-303.
12. M. Wang, J. Yang, Z. J. Xu, K. C. Chou, SLLE for predicting membrane protein
types, Journal of Theoretical Biology 232 (2005) 7-15.
13. S. Q. Wang, J. Yang, K. C. Chou, Using stacked generalization to predict membrane
protein types based on pseudo amino acid composition, Journal of Theoretical
Biology, in press (2006) doi:10.1016/j.jtbi.2006.1005.1006.
14. M. Wang, J. Yang, G. P. Liu, Z. J. Xu, K. C. Chou, Weighted-support vector
machines for predicting membrane protein types based on pseudo amino acid
composition, Protein Engineering, Design, and Selection 17 (2004) 509-516.
15. S. W. Zhang, Q. Pan, H. C. Zhang, Z. C. Shao, J. Y. Shi, Prediction protein homo-
oligomer types by pseudo amino acid composition: Approached with an improved
feature extraction and naive Bayes feature fusion, Amino Acids 30 (2006) 461-468.
16. Y. Gao, S. H. Shao, X. Xiao, Y. S. Ding, Y. S. Huang, Z. D. Huang, K. C. Chou,
Using pseudo amino acid composition to predict protein subcellular location:
approached with Lyapunov index, Bessel function, and Chebyshev filter, Amino
Acids 28 (2005) 373-376. 17. Y. Z. Guo, M. Li, M. Lu, Z. Wen, K. Wang, G. Li, J.
Wu, Classifying G protein-coupled receptors and nuclear receptors based on protein
power spectrum from fast Fourier transform, Amino Acids 30 (2006) 397-402.
18. H. Liu, M. Wang, K. C. Chou, Low-frequency Fourier spectrum for predicting
membrane protein types, Biochem Biophys Res Commun 336 (2005) 737-739.
19. K. C. Chou, Prediction of protein subcellular locations by incorporating quasi-
sequence-order effect, Biochemical & Biophysical Research Communications 278
(2000) 477-483.
20. K. C. Chou, A novel approach to predicting protein structural classes in a (20-1)-D
amino acid composition space, Proteins: Structure, Function & Genetics 21 (1995)
319- 344.
21. K. C. Chou, C. T. Zhang, Predicting protein folding types by distance functions that
make allowances for amino acid interactions, Journal of Biological Chemistry 269
(1994) 22014-22020.
22. K. C. Chou, C. T. Zhang, Review: Prediction of protein structural classes, Critical
Reviews in Biochemistry and Molecular Biology 30 (1995) 275-349.
23. K. C. Chou, D. W. Elrod, Protein subcellular location prediction, Protein
Engineering 12 (1999) 107-118.
24. K. C. Chou, Review: Prediction of protein structural classes and subcellular
locations, Current Protein and Peptide Science 1 (2000) 171-208.
25. K. C. Chou, D. W. Elrod, Prediction of membrane protein types and subcellular
locations, PROTEINS: Structure, Function, and Genetics 34 (1999) 137-153.
26. K. C. Chou, D. W. Elrod, Prediction of enzyme family classes, Journal of Proteome
Research 2 (2003) 183-190.
27. K. C. Chou, Y. D. Cai, Predicting enzyme family class in a hybridization space,
Protein Science 13 (2004) 2857-2863.
28. K. C. Chou, D. W. Elrod, Bioinformatical analysis of G-protein-coupled receptors,
Journal of Proteome Research 1 (2002) 429-433.
29. K. C. Chou, Prediction of G-protein-coupled receptor classes, Journal of Proteome
Research 4 (2005) 1413-1418.
30. K. C. Chou, Y. D. Cai, Prediction of protease types in a hybridization space,
Biochem. Biophys. Res. Comm. 339 (2006) 1015-1020.
31. K. C. Chou, Y. D. Cai, Predicting protein-protein interactions from sequences in a
hybridization space, Journal of Proteome Research 5 (2006) 316-322.
32. K. C. Chou, Y. D. Cai, W. Z. Zhong, Predicting networking couples for metabolic
pathways of Arabidopsis, EXCLI Journal 5 (2006) 55-65.
33. K. C. Chou, Y. D. Cai, Predicting protein quaternary structure by pseudo amino acid
composition, PROTEINS: Structure, Function, and Genetics 53 (2003) 282-289.
34. Kurić L. The digital language of amino acids. Amino Acids (2007) 653-661.
35. Kurić L. The Atomic Genetic Code. J. Comput Sci Biol 2 (2009) 101-116.
36. Kurić L. Mesure complexe des caracteristiques dynamiques de series temporelles
“Journal de la Societe de statistique de Paris”- tome 127, No 2.1986.
37. Kurić L. The Insulin Bio Code - Zero Frenquencies, GJMR Vol. 10 Issue 1: 15 May
2010.
38. Kurić L. Molecular biocoding of insulin, Advances and Applications in Bioinformatics
and Chemistry, Jul. 2010.p.45 – 58.
39. Kurić L. The Insulin Bio Code – Prima sequences, GJMR Vol. 1 Issue 1: 15 June
2010.
40. Kurić L. ATOMIC HEMOGLOBIN CODE, GJMR Volume 10 Issue 2 Version 1
October 2010.
41. Kurić L. Language of Insulin Decoded:Discret code 1128, IJPBS JOURNAL,
October 2010.
42. Kurić L. Measures of Bio Insulin Frequencies, IJCSET (Volume 1. Issue 4.
December, 2010)