## Harvard Stat 221 “Statistical Computing and Visualization”: all lectures online

Stat 221 is Statistical Computing and Visualization. It’s a graduate class on analyzing data without losing scientific rigor, and communicating your work. Topics span the full cycle of a data-driven project including project setup, design, implementation, and creating interactive user experiences to communicate ideas and results. We covered current theory and philosophy of building models for data, computational methods, and tools such as d3js, parallel computing with MPI, R.

All lecture slides are now available online:

- Lecture 1, Course Introduction
- Lecture 2, Introduction to Visualization, Modeling, and Computing (VMC)
- Lecture 3, Intro VMC – Modeling and Computing
- Lecture 4 – Guest Lecture by Rachel Schutt, Introduction to Data Science
- Lecture 5, A More Rigorous Look at Visualization
- Lecture 6, Statistical Models and Likelihood
- Lecture 7, Likelihood Principle, MLE Foundations, Odyssey
- Lecture 8, Stochastic Optimization for Inference, Odyssey
- Lecture 9, Modeling with Missing Data/Latent Variables
- Lecture 10, Expectation-Maximization Algorithm (EM)
- Lecture 11, EM for HMMs, Properties of EM
- Lecture 12, EM variants, Data Augmentation
- Lecture 13, Likelihood + Prior = Posterior (Bayesian Inference)
- Lecture 14, Missing Data and MCMC
- Lecture 15, Hamiltonian Monte Carlo (HMC)
- Lecture 16, Decision Theory and Statistical Inference
- Lecture 17, Parallel Statistical Computing
- Lecture 18, Parallel Tempering
- Lecture 19, Message Passing Interface (MPI) for Parallel Tempering
- Lecture 20, Equi-Energy MCMC Sampler
- Lecture 21, Approximate Methods: Variational Inference
- Lecture 22, Variational EM, Monte Carlo EM
- Lecture 23, Hacker Level: Data Augmentation
- Lecture 24, Interactive Experiences and Us
- Lecture 25, The Final Lecture: Summing It Up

I feel privileged to have been invited onto this journey together with the students. Together, we learned substantial theory, created interactive visualization, defined open problems in current research, structured our thinking about interactive user experiences, and are now finishing up working on course final projects with a roster of first-class course partners.

While the lectures are over, the journey of learning and new discoveries in data-driven projects doesn’t stop here. If anything, it’s only getting more interesting.

Hello,

I’m researching about statistics and design, and I came to this blogpost. I really liked it, but I wanted more information on this subject, specially the use of D3.js. Do you have the actual course recorded for iTunes U or any other online studying platform? ( Coursera, Edx, etc.?)

Thank you very much,

Cláudio

Glad you like the material!

Unfortunately, lecture videos are currently only available for residential students.

Claudio, vc é brasileiro?

Eu tbm cheguei a esse site pq trabalho na area de estistica computacional e desenvolvimento web. Tenho alguns projetos desenvolvidos com d3. Essa biblioteca é muito boa porém mal documentada. Quando vc começa a entender a logica do d3 verá que é muito potente. Infelizmente acho que ainda é muito cedo para encontrarmos algo bem detalhado. De qualquer modo te sugiro o site do Mike http://bost.ocks.org/mike/ . Tem o site do proprio d3 http://d3js.org/ . Tem um site de introducao ao d3 http://www.janwillemtulp.com/2011/03/20/tutorial-introduction-to-d3/

http://natescodevault.com/js/line/

http://www.dashingd3js.com/svg-paths-and-d3js

https://github.com/mbostock/d3/wiki/ .

Mas foi aqui que eu comecei a estudar d3. http://alignedleft.com/tutorials/d3/

abrcs

Claudio, I’ve been following this channel on YouTube, very helpful!

http://bit.ly/d3js-tuts

Boa sorte!

Marcelo

Sergiy,

Thank you so much for publishing these lectures!

Best,

Marcelo

You’re welcome — let me know if you have any suggestions on how to improve the material!

Pingback:Harvard Data Science | Data Science 101Pingback:Posterior samples | Sam CliffordGood looking stuff, I would like to download the presentations so I don’t have to be connected to the net to read them (and also so I can refer to them later). Are you purposefully preventing that or am I just not seeing how to do it?

Glad you like the material!

I don’t think there is an easy way to preserve interactive components such as this one

http://nesterko.com/lectures/stat221-2012/lecture5/#/14

when you save the presentations to your computer in a way that doesn’t let you connect to the Internet.

What you can do is save lecture slides to PDF by adding ?print-pdf to the end of their URLs like so

http://nesterko.com/lectures/stat221-2012/lecture5/?print-pdf

Then you can print the slides to PDF with Command-P. If you want to save paper, you can select landscape paper and 4 pages per sheet to get a more standard printed lecture slides format.

Hope this helps!

Very much so, thanks. I’ve been blown away by the interactive stuff, I wasn’t expecting that.

Pingback:Data Viz News [7] | Visual Loophello i like it very much.

Pingback:Data Science from Harvard | AloofnℇssPingback:Data Science from Harvard | datagraphy !