Lecture 06B
Read mapping
Methodology
Date: Feb 13, 2025
We'll explore the challenges of aligning millions of short reads to a reference genome and discuss various algorithms and data structures that make this process efficient. The session will focus on the Burrows-Wheeler Transform (BWT) and the FM-index, two key concepts that revolutionized read alignment by enabling fast, memory-efficient sequence searching.
Learning objectives¶
After today, you should have a better understanding of:
- The purpose of reference-based mapping.
- Hash-based methods for handling introns.
- Suffix arrays for efficient substring searches.
- Burrows-Wheeler Transform (BWT) string compression.
- FM-index for efficient substring searches.
- Splice-aware mapping with seed-chain-extend strategy.
Supplementary material¶
Relevant content for today's lecture.
- Suffix trees
- Suffix arrays
- Burrows-Wheeler transform
- FM-index
Presentation¶
- View: slides.com/aalexmmaldonado/biosc1540-l06b
- Live link: slides.com/d/v69HoBk/live
- Download: biosc1540-l06b.pdf
→