Large language models are the computer programs that are trained on vast amounts of data in order to learn how to generate text based outputs that sound as close to natural human speech. Large Language Models are typically built by layering methods and functions, each of which are optimized for different tasks in processing or generating text. These models are the basis of many AI tools including chatbots for online shopping, code generators, or automatic summarizers, but it cannot be used right out the box for these tasks. Language models are typically trained to be general purpose, so in order to transform a language model into a usable application, additional work must be done.

This course focuses on giving an introduction to the process of building a large language model. We will also take a deeper look at how one would select an appropriate existing large language model to adapt to a user's needs.