Every time you query a modern database, there is a decent chance the engineers who built it learned how databases work from one person’s YouTube channel.
Andy Pavlo runs the Database Group at Carnegie Mellon University. His lectures on advanced database systems are free. On YouTube. Hundreds of thousands of views. And they are, by most accounts, the best database education available anywhere.
What he teaches and why it matters
Database internals are the plumbing of the digital world. Every application you use, every transaction you make, every search result you see sits on top of a database. The people who build these systems need to understand query optimization, storage engines, concurrency control, indexing structures, and a dozen other topics that most computer science programs treat as electives.
Pavlo treats them as essentials. His lecture series covers advanced topics with a depth and clarity that makes paid courses look amateur. Students at CMU get this for credit. Everyone else gets it for free.
The research that changed databases
His lab produced NoisePage, a self-driving database that autonomously optimizes itself. His work on in-memory OLAP processing and vectorized query execution directly influenced DuckDB and other modern analytical databases.
He also built the CMU Database of Databases, cataloging over 900 database systems. If you have ever wondered “has someone built a database that does X?” the answer is probably in his catalog.
Why you do not know his name
Database internals researchers do not become famous. The field is too technical, too infrastructure-level, too invisible to end users. Nobody thanks the database engine when their search returns results in 50 milliseconds.
But the engineers who build those engines? Many of them trace their education back to Pavlo’s lectures. The knowledge flows from his classroom to YouTube to companies building products that millions of people depend on daily.
That pipeline, from one professor’s lecture to the databases running the internet, is invisible. And it is one of the most valuable educational contributions in modern computer science.
The open education argument
Pavlo could charge for this. Other professors do. Online database courses cost hundreds or thousands of dollars. His are free.
The result: the quality of database engineering across the entire industry is higher because one person decided that the knowledge should be accessible to anyone with an internet connection. Not just CMU students. Not just people who can afford a masters program. Anyone.
That decision, made quietly, without press releases or funding announcements, has trained more database engineers than most universities.
Andy Pavlo’s lectures are available at CMU Database Group YouTube. If you build anything that touches a database, start here.