The Bioinformatics group is a team of professional scientists focusing on study, application, development, and optimization of tools for the analysis of genomic and biological data generated by INGM scientists and collaborators. The facility works closely with the Institute’s researchers and gives access to biological data analyses and elaborations to all research groups. It supports basic and translational research with both standard and customized analyses; it provides and facilitates access to up-to-date as well as novel analytical methods.
Our team works in the Institute’s bioinformatics open space in strict contact with other bioinformatics researchers and graduate students. Our team members are either dedicated to single projects or work in collaboration on multiple ones, according to Institute’s needs, workloads and requirements of principal investigators. Our background is wide and covers biology, systems biology, computer science, biostatistics; our multidisciplinar nature allows us to have a fresh and systemic view of data and their biomedical and clinical context.
The IT personnel grants us the access to INGM’s in-house state-of-the-art high performance computing infrastructure and connectivity.
- Support on experimental design for data-intensive projects, data cleaning and data exploration.
- Medium to high throughput gene expression profiling: from RTqPCR arrays to microarrays.
- Next generation sequencing analyses: RNA sequencing, whole exome sequencing, custom panels, ChIP sequencing.
- Analysis of non-coding RNAs data, cellular and circulating microRNAs.
- Multivariate analyses for transcriptomics, genomics and proteomics; features selection, biomarker prioritization, descriptive and inferential biostatistics.
- Functional analyses for biological contextualization, gene ontology, methods for pathway analyses.
- Advanced functional analyses based on pathways impact, network metrics.
- Design and development of software applications for computational biology and genomics.
INGM bioinformaticians rely on a in-house high performance computing (HPC) cluster with more than 300 CPUs, 1.5 TB RAM and about 100 TB of disk storage. The infrastructure was deployed and is being maintained by th Information Technology personnel in collaboration with the bioinformatics group. The whole infrastructure is wired with high speed connectivity and protected by secure and backup systems.
Computational activities are performed on the HPC and managed by the Torque/PBS queue system on a series of virtual machines. Both the HCP and VMs run Ubuntu Linux operating system. Minor computational tasks can be also performed locally on personal workstations (Xeon PCs, Windows OS) and/or laptops (Win/iOS), which are also used as HPC clients.
Applications / Software
- CombiROC (http://www.combiroc.eu)
CombiROC is a web application for guided and interactive generation of multimarker panels.
- myVCF (http://myvcf.readthedocs.io/en/latest/)
myVCF is a application for high-throughput mutations data management managing multiple sequencing projects created from VCF files; it allows end-users without strong programming and bioinformatics skills to explore, query, visualize and export mutations data in a simple and straightforward way.
miRiadne is a tool for re-annotating miRNA namelists or datasets. Obsolete annotations (either due to older miRBase versions or out-dated profiling platforms) can be converted into newer ones enforcing mature sequence correspondence. This project is not further mantained and the application is not available anymore: for any enquire please contact the paper’s main author (see below).
- Computation and Selection of Optimal Biomarker Combinations by Integrative ROC Analysis Using CombiROC
Bombaci M., Rossi RL.
In: Brun V., Couté Y. (eds) Proteomics for Biomarker Discovery. Methods in Molecular Biology, vol 1959. Humana Press, New York, NY. (2019)
- Big Data: Challenge and Opportunity for Translational and Industrial Research
Rossi RL, Grifantini RM.
Front. Digit. Humanit. 5:13 (2018)
- myVCF: a desktop application for high-throughput mutations data management
Pietrelli A, Valenti L.
Bioinformatics btx475 (2017)
- CombiROC: an interactive web tool for selecting accurate marker combinations of omics data
Mazzara S, Rossi RL, Grifantini R, Donizetti S, Abrignani S, Bombaci M.
Sci Rep (2017) 7:45477
- Normalization of circulating microRNA expression data obtained by quantitative real-time RT-PCR
Marabita F, de Candia P, Torri A, Tegnér J, Abrignani S, Rossi RL.
Brief Bioinform (2016) 17:204-12
- miRiadne: a web tool for consistent integration of miRNA nomenclature.
Bonnal RJ., Rossi RL., Carpi D., Ranzani V., Abrignani S., Pagani M.
Nucleic Acids Res (2015) 43:W487-92