Downes.ca ~ Stephen's Web ~ A jargon-free explanation of how AI large language models work

A jargon-free explanation of how AI large language models work

Timothy B. Lee, Ars Technica, Sept 27, 2023
Commentary by Stephen Downes

This is a 'gentle primer' describing how large language models (LLM) work. Once again, notice that the AI doesn't 'copy' content from articles, and doesn't 'plagiarize' material. It focuses on word order and associations between works, creating a graph that allows it to use a neural network (2018 neural network explainer) to predict which word will come next in a sequence of words. And it is at least arguable that this is what humans do as well; "prediction may be foundational to biological intelligence as well as artificial intelligence. In the view of philosophers like Andy Clark, the human brain can be thought of as a "prediction machine" whose primary job is to make predictions about our environment that can then be used to navigate that environment successfully."

Today: 3 Total: 795 [Direct link] [Share]

View full size