GenerativeAI

How It Works

Introductions and some Technical Stuff

Full Library

Few of us possess actual understanding of how modern technology works at a schematic/mechanical level. Rather, while we never quite grasp how it works, we become comfortable once we know that it works.
‍

Calculators. Computers. Elevators. Cars. Airplanes. Dishwashers. ATMs. Microwaves. TV. Internet. Mobile. Cloud. Email. Cameras. For most of us, our knowledge of how the technology underpinning our daily lives functions is de minimus.
‍

New and surprising can be unsettling, especially if a breakthrough seems to presage some form of major disruption. It is natural to be curious about what is going on under the hood. While this is not the place to turn if you want to deeply understand parameters or loss functions, we have ourselves felt this compulsion to develop a sounder understanding of GenAI. What follows are resources we’ve consulted to raise our baseline comprehension in order to better identify signal in the noise.
‍

For us, a necessary but not sufficient, point of departure is that the underlying models are probabilistic rather than deterministic. That is, when working with only the raw models—as was the case with the original ChatGPT—outputs are based on probabilities, not any source of truth.
‍

A large language model, for example, is a model of language based on a large amount of data. This enables text prediction on steroids. As Stephen Wolfram opens his ur-explainer What is ChatGPT Doing…and Why Does It Work?, “It’s just adding one word at a time.”

Or to adapt a passage from Professor Murray Shanahan*:

Suppose we give an LLM the prompt “Who was the first person to walk on the Moon?”, and suppose it responds with “Neil Armstrong”.

What are we really asking here?

From our perspective, we are asking the literal question (“who was the first person to walk on the moon?”) in an attempt to elicit an accurate, factual answer (“Neil Armstrong”).

From the perspective of the model (which, again, is just a model), our prompt translates to:

Given the statistical distribution of words in the vast public corpus of (English) text, what words are most likely to follow the sequence “The first person to walk on the Moon was… ”?

That the model responds with “Neil Armstrong” is due to the regular co-occurrence of the two elements—i.e., the words “Neil Armstrong” are statistically mostly likely to follow the word sequence, “The first person to walk on the Moon was”).
‍

The probabilistic nature of these models is what makes them so adaptable. We now have the architecture (transformers), compute power (Moore’s Law), and data volumes (the internet) to enable the machines to generate giant models that can be relatively quickly adapted to all sorts of different use cases. Before this moment, we were largely building massive IF/THEN deterministic workflows to achieve similar results—often not as good; almost always far more labor intensive.
‍

On the other hand, the probabilistic nature of these models is also what makes them hallucinate (next section).
‍

Be warned: any analysis that concludes with the observation that the models are probabilistic, prone to hallucination, and, therefore, not to be relied upon is materially incomplete. First, it assumes that perfect accuracy is the sole standard. In reality, perfection is rarely the standard and those who presume persistent human infallibility are deluding themselves. Second, it assumes end users will only be interacting with the models in their raw form, completely ignoring the nascent but rapidly maturing application layer.*
‍

For more technical follows, we recommend Simon Willison and Andrew Ng.

‍

Is there enough text to feed the AI beast? | Semafor

A Stanford study predicts that AI models could run out of new text to scrape by the end of this year. However, the study's leader believes AI companies won't feel the crunch until the end of the decade. The amount of data on the internet is growing at a slower pace compared to the amount of data AI is being trained on. While the Stanford study suggests AI companies could run out of text within months, there will still be enough public data left to train AI models for the next five or six years.

Hide Summary

The New York Times

April 15, 2024

AI has a measurement problem

The lack of standardized evaluation and measurement for AI systems is a major problem. Without reliable information about AI products, it is difficult for people to know how to use them and assess their capabilities. Current tests for AI models have limitations and may not accurately measure their power. Governments, academia, and AI companies should collaborate to develop robust testing programs, fund research projects, and ensure transparency in model updates. Better measurement is crucial for effective AI governance and understanding the progress and implications of AI technology.

Hide Summary

The New York Times

April 15, 2024

A.I. Has a Measurement Problem

A lack of standardized evaluation for AI systems is a major problem. Without reliable information, it is difficult for people to know which AI tool to use for specific tasks. Shoddy measurement also creates safety risks, as it is hard to know which capabilities are improving and which products may pose harm. Governments, academia, and AI companies should collaborate to develop robust testing programs and new evaluations. Independent testing and auditing processes are needed, along with more transparency and availability of AI models for research and evaluation.

Hide Summary

Every

April 5, 2024

The Ultimate Guide to Prompt Engineering

This article discusses prompt engineering for AI models like ChatGPT. It explains the importance of prompt engineering and how it can improve the output of AI models. The author also mentions that prompt engineering will continue to be relevant even as AI models advance. The article provides five principles of prompting and emphasizes the need for direction in working with AI and human coworkers. Overall, it highlights the significance of prompt engineering in maximizing the potential of AI models.

Hide Summary

YouTube

April 2, 2024

But what is a GPT? Visual intro to Transformers

A visual introduction to Transformers and what a GPT is. Includes a YouTube video explaining the topic. Created on April 6, 2024, and still active.

Hide Summary

Futurism

April 1, 2024

AI Companies Running Out of Training Data After Burning Through Entire Internet

AI companies are facing a shortage of training data as the internet is not big enough to provide all the data they need. Some companies are exploring alternative sources such as publicly-available video transcripts and AI-generated "synthetic data." However, training AI models on AI-generated data has raised concerns about model collapse. Despite the data shortage, researchers believe breakthroughs can address the issue. Another solution suggested is for AI companies to stop focusing on creating bigger models due to the environmental impact and resource consumption.

Hide Summary

Wired, Quanta Magazine

March 24, 2024

Large Language Models’ Emergent Abilities Are a Mirage

Large language models' emergent abilities may not be as unpredictable as previously thought, according to a new paper by researchers at Stanford University. They argue that the sudden appearance of these abilities is a consequence of how researchers measure the models' performance. The study highlights the need for further research and understanding of large language models' behavior.

Hide Summary

Vulcan Post

March 19, 2024

Progress in AI is slowing down, and history teaches us why

The article discusses how progress in AI is slowing down and draws lessons from history to explain why. It emphasizes that the current trajectory of AI development does not align with the exponential growth often predicted. The article provides insights into the factors contributing to the slowdown and highlights the need for a more realistic understanding of AI progress.

Hide Summary

KMWorld

March 13, 2024

The flip side of generative AI: Extractive AI

Extractive AI takes a comprehensive and transparent approach to machine intelligence. It derives meaning and intent using layered reference ontologies, resulting in a more descriptive enumeration of the problem space. Extractive AI provides more structured, consistent, and precise responses, with traceable crumb trails back to the sources. It can assign confidence intervals and adjust recommendations based on the decision environment. While generative AI is exciting, extractive AI offers precise answers with serious implications. Both types of AI have their place and can be utilized for different purposes.

Hide Summary

OECD.AI [http://oecd.ai/]

March 6, 2024

What is AI? Can you make a clear distinction between AI and non-AI systems? - OECD.AI

AI is a machine-based system that generates outputs based on given objectives. There is no clear distinction between AI and non-AI systems, as AI technologies are evolving rapidly. The OECD defines an AI system as a machine-based system that infers how to generate predictions, content, recommendations, or decisions that can influence physical or virtual environments. AI systems vary in their levels of autonomy and adaptiveness. Input, including data, is used during development and operation. Different types of AI systems include machine learning and knowledge-based approaches. AI system autonomy refers to the system's ability to learn or act without human involvement. Adaptiveness allows AI systems to evolve their models after initial development. AI system objectives can be explicit, implicit, or learned from training data. The concept of "inferring how to" generate outputs refers to the system's ability to generate outputs from inputs. The OECD's definition of AI is inclusive and can be applied to various contexts, but additional criteria and regulations may be needed for specific situations.

Hide Summary

Spectrum IEEE

March 6, 2024

AI Prompt Engineering Is Dead

Prompt engineering, the practice of finding the best way to phrase queries to large language models (LLMs) or AI generators, may be best done by the AI models themselves rather than human engineers, according to recent research. Automated prompt generation tools have shown to outperform human-engineered prompts in terms of performance and speed. However, prompt-engineering jobs are still expected to exist in some form, as the challenges of deploying LLMs for commercial use require a range of tasks beyond prompt engineering.

Hide Summary

Harvard Business Review

February 6, 2024

Find the AI Approach That Fits the Problem You’re Trying to Solve

To choose the right AI approach for your problem, start with the problem itself rather than the technology. Understand the cost of being wrong, the need for explainability, the requirement for consistent answers, the availability of a source of truth, and the relevance of training data to your operational conditions. Consider the categories of generative AI, traditional deep learning, econometrics, and rule-based automation. By asking the right questions and working with technical experts, you can make better decisions and use the right tool for the job.

Hide Summary

View all

Caught our Attention

Orientation

AI in the World

Regulation and Liability

How It Works

Hope, Hype, Doom, & Gloom

Delivery of Legal Services

Intros & Tech

Performance

Enterprise Embraces LLMs

Big Tech Pivot

GenAI in Legal

SALI & Legal Language

Regulation

Lawsuits & Investigations

Data Security

Privacy

Privilege

Intellectual Property

Labor & Employment

Jobs

Startups & Capital

Consultants ♡ GenAI

GenAI in Finance

GenAI in Science & Medicine

GenAI in Education

The Model Makers

Open Source

Responsible & Explainable

Hallucinations

Applications, Stacks, Tools & Agents

Knowlege Management, Graphs, and Retrieval Augmented Generation

Seems Bad

Not All Bad

Things Will Get Weird

Educational Resources

Getting Started

Why AI language models choke on too much text

Cyberattacks Slowing Down M&A Deals, Firm Report Finds

Ballooning Workloads, Dearth of Advancement Opportunities Prime In-House Attorneys to Pull Exit Hatch

Guarantees Are Back, Whether Law Firms Want to Talk About Them or Not

Law Firms 'Still Lacing Up Their Shoes' in Gen AI Race, Report Says

Law Firms, Legal Departments Lean Into Gen AI Adoption to Attract Talent

'This Is Not a Coincidence': Lawsuit Blames Chatbot App Character.AI for Teen's Suicide

New Merger-Review Process Could Doom Some Deals, Add Headaches, Subjectivity to Others

US Treasury used AI to recover $4 billion in fraud over past year

Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland

'Utterly Bewildering': GCs Struggle to Grasp Scattershot Nature of Law Firm Rate Hikes

Cutting Partner Compensation Becomes Routine in Big Law

Wimbledon 2025: AI will call the shots, replacing humans first time in 147 years

A matter of taste: Electronic tongue reveals AI inner thoughts | Penn State University

DOJ indicates it’s considering Google breakup following monopoly ruling

Just 11% of Legal Departments Predict Gen AI Will Be 'Transformative,' As Its Honeymoon Phase Fades

California Law Requires Schools to Teach Students About AI

Rock-Star Law Firms Are Billing Up to $2,500 per Hour. Clients Are Indignant.

OpenAI is now valued at $157 billion

Gen AI Makes Legal Action Cheap — and Companies Need to Prepare

OpenAI raises $6.6 billion in largest VC round ever

AI for Legal Aid: How to supercharge legal services organizations - Thomson Reuters Institute

12 famous AI disasters

Army Testing Robot Dogs Armed with Artificial Intelligence-Enabled Rifles in Middle East

McDermott and Akin become latest US law firms to hire directors of AI - Legal IT Insider

Recommended Reading

Is there enough text to feed the AI beast? | Semafor

AI has a measurement problem

A.I. Has a Measurement Problem

The Ultimate Guide to Prompt Engineering

But what is a GPT? Visual intro to Transformers

AI Companies Running Out of Training Data After Burning Through Entire Internet

Large Language Models’ Emergent Abilities Are a Mirage

Progress in AI is slowing down, and history teaches us why

The flip side of generative AI: Extractive AI

What is AI? Can you make a clear distinction between AI and non-AI systems? - OECD.AI

AI Prompt Engineering Is Dead

Find the AI Approach That Fits the Problem You’re Trying to Solve