Deep learning and Natural Language Processing
Natural language processing (NLP) is one of the most important technologies of the information age. Understanding complex language utterances is also a crucial part of artificial intelligence. Applications of NLP are everywhere because people communicate most everything in language: web search, advertisement, emails, customer service, language translation, radiology reports, etc. There are a large variety of underlying tasks and machine learning models powering NLP applications. Recently, deep learning approaches have obtained very high performance across many different NLP tasks.
These models can often be trained with a single end-to-end model and do not require traditional, task-specific feature engineering. In this spring quarter course students will learn to implement, train, debug, visualize and invent their own neural network models. The course provides a deep excursion into cutting-edge research in deep learning applied to NLP. The final project will involve training a complex recurrent neural network and applying it to a large scale NLP problem. On the model side we will cover word vector representations, window-based neural networks, recurrent neural networks, long-short-term-memory models, recursive neural networks, convolutional neural networks as well as some very novel models involving a memory component.
Through lectures and programming assignments students will learn the necessary engineering tricks for making neural networks work on practical problems.
- Proficiency in Python
- College Calculus, Linear Algebra
You should be comfortable taking derivatives and understanding matrix vector operations and notation.
- Basic Probability and Statistics
You should know basics of probabilities, gaussian distributions, mean, standard deviation, etc.
- Equivalent knowledge of Machine Learning
We will be formulating cost functions, taking derivatives and performing optimization with gradient descent.