Week 1: Intro to the course, omics data, and OSC
Overview
Welcome to the AU25 course Practical Computing Skills for Omics Data! In this first week of the course, you will get an overview of the course’s contents and introductions to omics data and the Ohio Supercomputer Center (OSC).
Learning goals & lectures
In each of the following lectures, you will learn:
Course intro (Tuesday class)
- What to expect from this course
- Which tools and languages we will use
- What is expected of you during this course
Omics data (Tuesday/Thursday class)
- What omics data is
- What the main high-throughput sequencing technologies are
- How Illumina sequence libraries are structured
- What reference genomes are, and what they are good for
- What the main sequence file types are and what FASTA, FASTQ, and GFF/GTF files look like
The Ohio Supercomputer Center (OSC) (Thursday class)
- What a supercomputer is, and why they are useful
- What resources the Ohio Supercomputer Center (OSC) provides
- How to access OSC resources through its OnDemand webportal
Readings
Poinsignon et al. (2023): Working with omics data: An interdisciplinary challenge at the crossroads of biology and computer science
This book chapter provides an overview of different kinds of omics data.Markowetz (2015): Five selfish reasons to work reproducibly
This short paper gives a light-hearted overview of self-interested reasons to perform your research reproducibly.“Overview of Illumina Sequencing by Synthesis Workflow”
Watch this 5-minute video to learn how Illumina sequencing works. This is meant to provide a bit more background, given that the example data in this course will be from Illumina sequencing. This is a very quick overview and you don’t need to understand all the details! (Optionally, also watch the video overviews of Oxford Nanopore and Pacific Biosciences sequencing technology that are embedded in the next two slides.)
Assignments & exercises
- Ungraded assignment: Pre-course survey Deadline: Tue Aug 26, 10 am
- Ungraded assignment: Ohio Supercomputer Center account setup Deadline: Wed Aug 27, 5 pm
Further resources
Throughout the course, chapters from these two books are optional readings for this course. Both can be accessed online for free with OSU credentials via the links below.
Buffalo (2015): Bioinformatics Data Skills (Buffalo 2015)
This week: Preface and Chapter 1 – How to Learn BioinformaticsAllesina (2019): Computing Skills for Biologists (Allesina & Wilmes 2019)
This week: Chapter 0 – Building a Computing Toolbox (Introduction & Section 0.1 only)
Browse through the below paper titles and see if any of these spark your interest — many of these present omics and HTS data in a more field-specific light, e.g. plant pathology.
- Lee (2023): The principles and applications of high-throughput sequencing technologies (link)
- Dijk et al. (2023): Genomics in the long-read sequencing era (link)
- Heyden et al. (2025): Advancing species conservation and management through omics tools (link)
- Nizamani et al. (2023): High-throughput sequencing in plant disease management: a comprehensive review of benefits, challenges, and future perspectives (link)
- Aragona et al. (2022): New-generation sequencing technology in diagnosis of fungal plant pathogens: A dream comes true? (link)
- Mahmood et al. (2022): Multi-omics revolution to promote plant breeding efficiency (link)
- Rai, Yamazaki, and Saito (2019): A new era in plant functional genomics (link)
- Pinto and Bhatt (2024): Sequencing-based analysis of microbiomes (link)
- Manzoni et al. (2018): Genome, transcriptome and proteome: the rise of omics data and their integration in biomedical sciences (link)