Question 1

What is a large language model?

Accepted Answer

A large language model (LLM) is a neural network with billions of parameters trained on vast corpora of text (and increasingly code, images, and audio) to predict the next token in a sequence. Modern LLMs are transformer-based and power chat assistants, code copilots, retrieval systems, and agentic applications.

Question 2

What is the difference between GPT, Claude, Gemini, and Llama?

Accepted Answer

GPT is OpenAI's family. Claude is Anthropic's. Gemini is Google's. Llama is Meta's open-weight family. They differ on training data, post-training methods (RLHF, constitutional AI, etc.), context window size, tool-use capability, pricing, and availability. Most enterprises use more than one.

Question 3

How large is 'large' in practice?

Accepted Answer

Current frontier models have 100B+ parameters (often much more, though many labs no longer publish counts). Open-weight models range from 1B–400B+ parameters. Size correlates with capability, but post-training technique, data quality, and retrieval now matter at least as much as raw parameter count.

Question 4

What are the enterprise deployment options?

Accepted Answer

API access from the model provider (OpenAI, Anthropic, Google, Mistral, AWS Bedrock), private deployment inside a customer VPC (Anthropic via Bedrock, Gemini via Vertex), or self-hosted open-weight models (Llama, Mixtral, Qwen). Each trades cost, latency, control, and compliance differently.

Question 5

How does NUUN AI choose models for client builds?

Accepted Answer

We select per use case, not per vendor loyalty. We evaluate on a held-out test set drawn from the client's actual workload, and we design architecture so the model is swappable. Vendor lock-in at the model layer is a liability we avoid by default.

Large Language Model (LLM)

WHAT IT IS

HOW IT WORKS

WHEN TO USE

RELATED

SOURCES

Related questions.

Need this term in action?