Knowledge-driven Natural Language Generation
- Time: Wednesday and Friday 2pm-3:15pm
- Place: 1131 Sibel
- Instructor: Prof. Heng Ji (Email:
hengji@illinois.edu; Office: 3318 Sibel; Office hours: Wednesday 3:15pm-4:15pm)
- TA: Ying Lin (Email:yinglin8@illinois.edu; Office: Siebel 1115)
Lectures and Schedules
Course Description
In this course we will teach machines to describe knowledge they have learned from data. We will develop a set of
intelligent systems which can transform structured knowledge bases into natural language, which is an opposite
direction
of Information Extraction. The topics will cover both of the conventional template filling based approaches and
modern
neural networks based generation approaches. We will dive deep into various technical components: how to
represent
knowledge, how to feed knowledge into a generation model, how to evaluate generation results? We will do three
project-based assignments and a final term project interesting applications including:
- News image and video caption generation to describe entities and events
- Generate scripts for news videos
- Generate scientific ideas and write technical papers
- Write technical reviews from various perspectives
- Write a bio for all kinds of professionals, artists and paintings
- Write a news article about scientific discovery results and perspectives about events
- Write a history book a news timeline based on event-centric knowledge bases
- Make Alexa smarter by feeding information from background and real-time news streams
- Or the other way around: Generate paintings, news videos and yoga instructional videos
This is an advanced graduate-level course, and the prerequisites include Natural Language Processing and Machine
Learning course.
Grading
- The instructor gives an overview tutorial (2pm-2:50pm), followed by Q/A and discussions (2:50pm-3:15pm).
- 3 Assignments (15pts each): participate in shared tasks and submit through Codalab, the grades will be based
on
your system’s rank in the class. If a submission is successfully complete then it will get 10 basic score,
and
then the additional score is based on the system's relative rank in the class between [0, 5]
- Pick one projet from five applications or one theory problem, final Paper submitted to
EMNLP2020/CIKM2020/AACL2020/COLING2020 (55pts): you can either continue working on one of the assignments
and
focus on novel methods, or choose a new topic to work on. we will host open-house poster session and select
a
paper for best paper.
Course Materials
The instructor will write most of the slides and hand-outs, give survey about the best papers from
The following books may provide some useful background:
Prerequisites:
- Programming in Python
- Already took NLP and machine learning courses
- Familiar with Probability and Statistics
- Research background in NLP or related fields; it's targeted for students who work on related areas.
Extensive research background is expected.
- Interests in languages and lingusitics
- Solid background in algorithms, probabilities
- Good programming skills
- Sufficient mathematical background
Class Policy
- Restricted Academic Integrity
- NO Incompleteness are accepted
- NO Late Assignments are accepted
- NO cell phones and internet surfing are allowed in the classroom
- Don't come late to the class, if you are late more than 10 minutes, simply skip it in order not to disturb
the
lecture
Mailing List
- nlg-2020@lists.cs.illinois.edu