Osteoarthritis and Cartilage Open (Mar 2022)

Osteoarthritis Data Integration Portal (OsteoDIP): A web-based gene and non-coding RNA expression database

  • Chiara Pastrello,
  • Mark Abovsky,
  • Richard Lu,
  • Zuhaib Ahmed,
  • Max Kotlyar,
  • Christian Veillette,
  • Igor Jurisica

Journal volume & issue
Vol. 4, no. 1
p. 100237

Abstract

Read online

Objective: OsteoDIP aims to collect and provide, in a simple searchable format, curated high throughput RNA expression data related to osteoarthritis. Design: Datasets are collected annually by searching “osteoarthritis gene expression profile” in PubMed. Only publications containing patient data and a list of differentially expressed genes are considered. From 2020, the search has expanded to include non-coding RNAs. Moreover, a search in GEO for “osteoarthritis” datasets has been performed using ‘Homo sapiens' and ‘Expression profiling by array’ filters. Annotations for genes linked to osteoarthritis have been downloaded from external databases. Results: Out of 1204 curated papers, 63 have been included in OsteoDIP, while GEO curation led to the collection of 28 datasets. Literature data provides a snapshot of osteoarthritis research derived from 1924 human samples, while GEO datasets provide expression for additional 1012 patients. Similar to osteoarthritis literature, OsteoDIP data has been created mostly from studies focused on knee, and the tissue most frequently investigated is cartilage. GEO data sets were fully integrated with associated clinical data. We showcase examples and use cases applicable for translational research in osteoarthritis. Conclusions: OsteoDIP is publicly available at http://ophid.utoronto.ca/OsteoDIP. The website is easy to navigate and all the data is available for download. Data consolidation allows researchers to perform comparisons across studies and to combine data from different datasets. Our examples show how OsteoDIP can integrate with and improve osteoarthritis researchers’ pipelines.

Keywords