|1||Apr 1 and 3||-||
|2||Apr 8 and 10||
(On your own: ASR: Language Modeling)
|3||Apr 15 and 17||
|4||Apr 22 and 24||
Social Meaning Extraction
|5||Apr 29 and May 1||
Conversational Agents: Human conversation, simple frame-based dialogue systems
Conversational Agents: Grounding, Confirmation, Dialogue Acts, Evaluation
|6||May 6 and 8||
Extracting Social Meaning
Extracting Paralinguistics and Medical Informatics
|7||May 13 and 15||
Conversational Agents III: Markov Decision Processes (MDPs), etc
Siri and Mobile Conversational Agents
|8||May 20 and 22||
Text to Speech (TTS)
Speaker Identification and Verification (and Conclusion of TTS)
|9||May 27 and 29||
Deep Neural Networks for Acoustic Modeling (Lecture by Andrew Maas)
Final Project Draft Presentations
|-||Monday June 9: 12:00 noon||-||-||
Final Project Due
Tuesday and Thursday 2:15-3:30pm. Room is currently 260-113, although it might change so watch this space.
If you have a question that is not confidential or personal, post it on the Piazza forum - responses tend to be quicker and have a wider audience. To contact the teaching staff directly, we strongly encourage you to come to office hours. If that is not possible, you can also email (non-technical questions only) to the course staff list, firstname.lastname@example.org. We can not reply to email sent to individual staff members. If you have a matter to be discussed privately, please come to office hours, or use email@example.com to make an appointment. For grading questions, please talk to us after class or during office hours.
We use the mailing list generated by Axess to convey messages to the class. We will assume that all students read these messages.
Since we occasionally reuse homeworks from previous years, we expect students not to copy, refer to, or look at the solutions in preparing their answers. It is an honor code violation to intentionally refer to a previous year's solutions. This applies both to the official solutions and to solutions that you or someone else may have written up in a previous year. It is also an honor code violation to find some way to look at the test set or interfere in any way with programming assignment scoring or tampering with the submit script.
Introduction to spoken language technology with an emphasis on dialogue and conversational systems. Automatic speech recognition, extraction of affect and social meaning from speech, speech synthesis, dialogue management, and applications to digital assistants, search, and recommender systems.
CS 124, 221, 224N, or 229
5 homeworks. Homework is due at 2:00pm on the day it is due (i.e. before class starts).
Programming Assignment Collaboration: You may talk to anybody you want about the assignments and bounce ideas off each other. But you must write the actual programs yourself.
You have 5 free late (calendar) days to use on the programming assignments Once these are exhausted, any PA turned in late will be penalized 20% per late day. Each 24 hours or part thereof that a homework is late uses up one full late day.
We will expect you to do a significant amount of textbook reading in this course.
There is no final exam for this course
Any project in speech recognition, speech synthesis, speech understanding, dialogue design, speech user interface design, etc etc. Projects should be joint; 3 people is optimal; 2 is acceptable only if you have a convincing reason. The final project will be presented as a poster at the poster session on Tuesday June 3, and is due on Monday June 9 at noon PST by email