Applying bioinformatics to resolve biological problems. This is the objective of the research group of the University of Malaga "BI4NEXT", which, in one of its latest studies, developed in the Supercomputing and Bioinnovation Center (SCBI) based on biobank samples, has identified new biomarkers for the diagnosis, prognosis and even treatment of lung cancer.
A discovery published in the scientific journal PeerJ, since it proves that both tumour cell and healthy cell in the repetitive DNA regions, mainly formed by transposon fosils, are consistently, differentially expressed in a controlled way.
"Since 2010, we have worked in the belief that repetitive elements in healthy cells were dormant, and that when the cell becomes cancerous, these regions deregulate, express wildly and cause resistance to treatment", explains Professor Gonzalo Claros of the UMA Department of Molecular Biology and Biochemistry, who asserts that his research supports that this would not be the case, that repetitive sequences both in normal and tumour tissue are specifically expressed and regulated. "What we did evidence is that such control changes when a normal cell becomes cancerous", he says.
Consequently, this research group of the UMA has identified some repetitive regions that behave similarly in all patients and all types of lung cancer studied. This is the case of AluYg6 and LTR18B elements, which are repressed in all lung cancer cells, and HERVK11D-Int and UCON88 elements, which activate specifically in adenocarcinoma and small-cell lung carcinoma, respectively.
A bioinformatic study that is pending on validation at the laboratory, but nevertheless it represents one step closer to the confirmatory diagnosis of lung cancer, as well as a new source of information for specialists, being extendable to all diseases with a genetic component.
New source of biomarkers
The researchers of "BI4NEXT'" propose to study the repetitive elements -more than 50 per cent of the genome- as a new source of biomarkers, still barely exploited. Thus, they highlight the accuracy and cost and time saving that would derive from the analysis of these expression markers, in addition to genome mutations, based on high throughput sequencing.
Translating it into clinical application is the challenge faced by this group of scientists, who, as a potential solution, propose the use of fluid biopsy to find repetitive sequences of transposons, which express consistently as explained above. "Diagnosis robustness would be increased and, moreover, it would take less time and sample portion", points out Claros.
For this study, the researchers have compared for almost 3 years healthy tissue and tumour tissue from the same patient with lung cancer -analysis which is normally performed on tissues of different patients. A biobank sample of 50 patients from Korea and 17 patients from China, based on public databases, which eventually 8 patients from the Regional Hospital of Malaga joined in through the biobank.
"Just by sequencing the eight patients from the biobank of Malaga, using for such purpose the high throughput sequencing platform of SCBI and Picasso supercomputer, we obtained the data we had verified on other thousand patients from other databases", says the Professor of Molecular Biology and Biochemistry, who indicates, moreover, that all this information is available to the scientific community.
Macarena Arroyo, Pulmonologists of the Regional Hospital of Malaga, is the main author of this study, which she started in 2013 as PhD Thesis, under the leadership of Professor Gonzalo Claros, together with Manuel Cabo, Oncologist, and Rocío Bautista, PhD in Biology and Bioinformatics Specialist at the UMA. Nowadays, this study represents the starting point for new research lines like the scientific article recently published in PeerJ.
Arroyo M, Bautista R, Larrosa R, Cobo MÁ, Claros MG. 2019. Biomarker potential of repetitive-element transcriptome in lung cancer. PeerJ 7:e8277 https:/