Professor: Daniel Seita (Office: GCS 305A)
Use Brightspace to contact the course staff.
Thursdays: 4:00 pm to 7:20 pm. Location: TBD.
We will have an extended 25-minute break approximately halfway through each session to give everyone time to rest, use the restroom, grab a snack, etc.
We will record lectures and share them on this website (see the "Recording" column below).
Daniel: Thursdays 11:00 am to 12:00 pm (GCS 305A), except for Thursdays when there is no class or when I am traveling. In those cases, I will offer alternative times, but send me an email if you want to meet at other times.
This course covers state-of-the-art advances in robot manipulation, focusing on how robots physically interact with and affect their environments through actions such as grasping, pushing, pick-and-place, tool use, and dynamic contact behaviors. Due to recent advances in robot learning, robot perception, and robot hardware, robot manipulation has experienced tremendous interest and growth in recent years. Despite this progress, real-world manipulation remains fundamentally hard, and progress can be tricky to measure due to lack of standardized benchmarks and cherry-picked demonstrations.
Students will study core principles of manipulation (including perception, control, planning, and dynamics) with modern data-driven approaches such as imitation learning, reinforcement learning, diffusion-based policy generation, and multimodal (e.g., vision-language-tactile) reasoning. While assignments will use simulators for faster and more reproducible work, this class also discusses how to get such systems to work more reliably in unstructured real-world settings, so that students develop an appreciation of the challenges of building such systems.
The class mostly consists of instructor-led lectures but will have some student-led discussions of recent research papers. There will also be a substantial final project. This class is aimed at PhD students doing research in this topic; advanced undergraduates and master's students are welcome to enroll with permission of the instructor.
This course is intended to:
At the end of this course, students will be able to:
No formal prerequisites. Recommended preparation: familiarity with robotics at the level of CSCI 445L or CSCI 545, and familiarity with machine learning at the level of CSCI 467. Concurrent enrollment with these courses can be helpful. Students should be comfortable programming in Python and working in a Linux/Ubuntu shell environment.
Primary textbooks (both freely available online):
Secondary textbooks (also open source):
Additional readings (research papers, code, etc.) will be provided as needed and will all be open-source and free to access.
| Homework 1 | 20% |
| Homework 2 | 20% |
| In-Class (Written) Midterm | 15% |
| Participation | 5% |
| Class Presentation | 10% |
| Final Project | 30% |
| Total | 100% |
We will use Brightspace for communication and grading. We do not use a fixed grading scale (e.g., 90–100 = A); grades are computed at the end of the semester. Missing a homework assignment will reduce the maximum achievable grade by one letter. Students will be allowed a small number of late days for the two homework assignments; late days cannot be used for the final project.
In lieu of a final exam, students will work on a substantial final project and may work in groups of 1–2 (with expectations scaling for teams of two). The final project grade is subdivided into: project presentation (1/5), final written report (1/5), and quality of results (3/5). The final project report should ideally be of sufficient quality to form the basis of a future submission to a top-tier robotics conference.
TBD.
Syllabus (Subject to Change)
| Date | Week | Topic | Assignment | Readings / References | Slides | Recording |
|---|---|---|---|---|---|---|
| Thurs Aug 27 |
01 | Course introduction; review of basic robotics concepts; examples of recent manipulation research | Sign up for paper presentation dates | MR, Ch 2 and 3.1 | ||
| Thurs Sep 03 |
02 | Forward and inverse kinematics; pick-and-place methods | HW 1 (released) | MR, Ch 4 (excl. 4.1) and 6.1; RT, Ch 3 |
||
| Thurs Sep 10 |
03 | Computer vision and 3D geometric perception for robot manipulation | RT, Ch 4 | |||
| Thurs Sep 17 |
04 | Robot simulators; sim-to-real transfer | HW 1 due | RT, Ch 2 (excl. control sections); NVIDIA survey paper |
||
| Thurs Sep 24 |
05 | Motion planning: sampling and optimization | HW 2 (released) | MR, Ch 9.2 and 10 (up to 10.5); RT, Ch 6 (excl. graphs of convex sets) |
||
| Thurs Oct 01 |
06 | Imitation learning | Project Proposal due | UR, Ch 21; IL survey |
||
| Thurs Oct 08 |
No Class (Fall Recess) | |||||
| Thurs Oct 15 |
07 | Reinforcement learning | HW 2 due | RT, Ch 11; SB, Ch 3–4 |
||
| Thurs Oct 22 |
08 | Model-based and model-free dynamics; robot control (manipulator, position, and force) | MR, Ch 8.1 and 11; RT, Ch 8.1–8.3 |
|||
| Thurs Oct 29 |
09 | Midterm Exam (in class, about 1 hour) | ||||
| Thurs Nov 05 |
10 | Robot hardware: tactile sensors, dexterous hands, and humanoids | RT, Ch 2.3–2.4 and 12; Tactile manipulation survey |
|||
| Thurs Nov 12 |
11 | Robot foundation models; vision-language-action (VLA) models | Project Milestone due | RT, Ch 9.4; UR, Ch 2; FM survey; arxiv 2402.05741; arxiv 2312.07843 |
||
| Thurs Nov 19 |
12 | Student presentations; student-selected topic | TBA | |||
| Thurs Nov 26 |
No Class (Thanksgiving) | |||||
| Thurs Dec 03 |
13 | Student presentations (cont.); course summary and remarks; Final project presentations |
Final Report due Sun Dec 06 |
TBA |