Agent Memory
The capacity of an AI agent to store and retrieve past data in order to enhance how it performs tasks.
AI Reference
Browse a practical dictionary of AI and LLM terminology used across Promptineer.
The capacity of an AI agent to store and retrieve past data in order to enhance how it performs tasks.
AI system that operates on its own, making decisions and interacting with tools as needed.
An advanced RAG setup where the AI agent(s) chooses on its own when to search for external information and how to use it.
AI systems that make decisions on the fly, choosing actions and tools without being manually instructed each step.
The final training phase to align model's actions with human ethics and safety requirements. QA for values and safety.
A component in neural networks that helps the model focus on relevant parts of input data, similar to how humans pay attention to specific aspects of information.
Systematic errors in AI models that can lead to unfair or prejudiced outcomes, often reflecting biases present in training data.
A reasoning strategy where an AI agent/model explains its thinking step-by-step, making it easier to understand and improving performance.
A method where an AI agent completes a task by breaking it into ordered steps and handling them one at a time.
The overall structure that manages how an AI system processes input, decides what to do, and generates output.
The maximum amount of text or tokens that an AI model can process at once, determining how much context it can consider.
A training method where AI model improves by learning from large sets of new, unlabeled data.
Reusable blueprints or strategies for designing effective AI agents.
Compressing a large AI's knowledge into a smaller model by teaching it to mimic predictions.
A numerical representation of text, images, or other data that captures semantic meaning and relationships in a vector space.
A technique where an AI model learns to perform a task with only a few examples, rather than requiring extensive training data.
The process of adapting a pre-trained model to a specific task by training it on a smaller, task-specific dataset.
Allows AI agents to dynamically call external functions based on the requirements of a specific task.
When an AI model generates false or made-up information that appears to be factual, often occurring in text generation.
A design where humans stay involved in decision-making to guide the AI's choices.
Models that blend fast and deep thinking. Adapting their reasoning depth depending on how hard the problem is.
The process of using a trained AI model to make predictions or generate outputs based on new input data.
A structured representation of information that shows relationships between different entities, helping AI systems understand connections.
A compressed representation of data where similar items are closer together, allowing AI models to understand and generate new content.
AI model that creates content like text or images, often used in generative tasks.
Memory that enables an AI to keep and access information across multiple sessions or tasks. What we see in ChatGPT, Claude, etc.
Large Reasoning Models: built for complex, logical problem-solving beyond simple generation.
A protocol that allows AI agents to connect with external tools and data sources using a defined standard, like how USB-C lets devices plug into any compatible port.
A neural network model setup that directs tasks to the most suitable sub-models for better speed and accuracy.
The internal variables that an AI model learns during training, determining how it processes and responds to input data.
A setup where several AI agents work together, each handling part of a task to achieve a shared goal more effectively.
A computing system inspired by biological neural networks, consisting of interconnected nodes that process information.
A machine learning approach where a model learns to recognize new objects or concepts from a single example.
A coordinator that oversees multiple AI agents, assigning tasks and deciding who does what and when. Think about it as a manager of agents.
When an AI agent spends too much time or uses excessive tokens to solve a task often fixed by limiting how deeply it reasons.
Further training of a model after its initial build to improve alignment or performance. Pretty same what's Alignment.
AI's ability to remember how to perform repeated tasks, like following a specific process or workflow it learned earlier.
The practice of carefully crafting input prompts to get desired outputs from AI models, especially in language models.
The process of reducing the precision of model parameters to make AI models more efficient and faster to run.
An approach where an AI agent combines reasoning and acting. First thinking through a problem, then deciding what to do.
AI models that evaluate situations, pick tools, and plan multi-step actions based on context.
A method where an AI agent looks back at its previous choices to improve how it handles similar tasks in the future.
Method of training where AI learns by trial and error, receiving rewards or penalties.
Uses human feedback to shape the model's behavior through rewards and punishments.
A technique where model pulls in external data to enrich its response and improve accuracy or get a domain expertise.
A strategy where an AI system sends parts of a task to the most suitable agent or model based on what's needed.
When an AI agent identifies its own errors and fixes them automatically. No human involvement or help needed.
A training approach where models learn from unlabeled data by creating their own learning objectives.
A form of memory in AI that holds information briefly during one interaction or session.
A method where AI agents or models are required to return responses in a specific format, like JSON or XML, so their outputs can be reliably used by other systems, tools or can be just copy/pasted elsewhere.
Training AI model with labeled data to specialize in specific tasks and improve performance.
A parameter that controls the randomness of AI model outputs, with higher values leading to more creative but potentially less focused responses.
A technique that lets an AI agent adjust how deeply it thinks at runtime, depending on how complex the task is.
The basic unit of text that an AI model processes, which can be a word, part of a word, or a character.
External services or utilities that an AI agent can use to carry out specific tasks it can't handle on its own. Like web search, API calls, or querying databases.
Using knowledge gained from solving one problem to help solve a different but related problem.
A type of neural network architecture that uses attention mechanisms to process sequential data, forming the basis of many modern LLMs.
A specialized database that stores and retrieves vector embeddings, enabling efficient similarity search for AI applications.
Agents built for a specific field like legal, healthcare, or finance, so they perform better in those domains.
A parameter in a neural network that determines the strength of connections between neurons.
Predefined logic or code paths that guide how AI system, models and tools interact to complete tasks.
The ability of an AI model to perform tasks it hasn't been explicitly trained on, using its general understanding.