Content-type: text/html ~ Stephen's Web ~ A jargon-free explanation of how AI large language models work

Stephen Downes

Knowledge, Learning, Community

This is a 'gentle primer' describing how large language models (LLM) work. Once again, notice that the AI doesn't 'copy' content from articles, and doesn't 'plagiarize' material. It focuses on word order and associations between works, creating a graph that allows it to use a neural network (2018 neural network explainer) to predict which word will come next in a sequence of words. And it is at least arguable that this is what humans do as well; "prediction may be foundational to biological intelligence as well as artificial intelligence. In the view of philosophers like Andy Clark, the human brain can be thought of as a "prediction machine" whose primary job is to make predictions about our environment that can then be used to navigate that environment successfully."

Today: 2 Total: 1485 [Direct link] [Share]

Stephen Downes Stephen Downes, Casselman, Canada

Copyright 2024
Last Updated: May 22, 2024 2:19 p.m.

Canadian Flag Creative Commons License.