Glossary of AI: Common Artificial Intelligence Terms

This AI list of terms and definitions helps businesses and customers regardless of their technical expertise, to comprehend the significance of generative AI. Let’s explore some of the common AI terms and definitions that are crucial for understanding a number of AI-related concepts.

AI (Artificial Intelligence)

Simulation of human cognitive skills in order to perform tasks such as problem solving, learning, and self-correcting.

AI Accelerator

Hardware or system designed specifically for the purpose of executing AI-related functions. Examples include NVIDIA GPUs.

Algorithm

Instruction sequence designed for AI training, problem-solving, learning, and reasoning.

Ampere Architecture

NVIDIA’s GPU design made specifically for accelerating AI, big data analytics, and HPC.

Bare Metal

A server that is physically located on a user’s premises as opposed to being in a data center.

Benchmark

A reference point or standard measurement used for comparison.

Big Data

Large volumes of data that require complex processing in order to extract meaningful information about patterns.

Black Box

An AI system whose decision-making mechanism is opaque despite being able to produce desirable results.

Cluster

Several computers connected together in order to perform specific tasks as a single system.

Colocation

The practice of placing privately-owned servers at a co-location facility.

Compute

Processing capability offered in the cloud or through data centers.

Containerization

Virtualization at the application level. Enables isolation of apps in order to run them on different OSs.

Conversational AI

Technology capable of sustaining a conversation by utilizing natural language processing (NLP).

CUDA (Compute Unified Device Architecture)

Parallel computing software created by NVIDIA. It enables GPUs to perform general computing operations.

CUDA Core

Processing element responsible for executing CUDA instructions on NVIDIA GPUs.

Deep Learning

ML discipline which utilizes multi-layer neural networks in order to process and analyze data with high levels of abstraction.

DGX System

NVIDIA’s AI supercomputers capable of performing deep learning.

Finetuning

The process of improving the performance of an already trained AI model on a specific set of data.

FP16 (Half Precision Floating Point)

Standardized format of binary representation of real numbers which utilizes 16 bits. It is used for training AI models.

FP32 (Single Precision Floating Point)

Format of binary representation of real numbers encoded with 32 bits. Used in gaming and training AI models.

GDDR Memory

Video memory type which is typically found in GPUs. It is used for high-bandwidth data access needed in gaming and AI.

Generative AI

An AI paradigm which enables machines to create new data such as text, images, and videos from existing data.

GPU (Graphics Processing Unit)

Processor which is designed to perform parallel processing tasks. It is commonly used in AI and ML.

HPC (High-Performance Computing)

Broad discipline which focuses on using supercomputers to solve complex problems by employing large amounts of processing power.

Hypervisor

Computer software which enables a guest operating system to run on another host computer.

InfiniBand (IB)

Networking standard which enables high-performance computing, particularly large-scale parallel processing.

Internet of Things (IoT)

Network of physical objects which are connected and exchange data over the internet. AI enhances IoT devices’ capabilities, such as enabling them to make predictions.

IOPS (Input/Output Operations Per Second)

Metric used in measuring the performance of storage devices.

Kubernetes

Open-source software which enables the management of workloads running in containers.

Latency

Delay which occurs between the time that information is sent and the time that it is received.

LLM (Large Language Model)

An AI program which is capable of understanding and generating realistic-sounding text.

Load Balancing

The process of distributing workloads among resources in a manner that ensures optimal use.

Machine Learning (ML)

AI discipline which enables computer systems to learn from data using a set of mathematical algorithms.

MIG (Multi-Instance GPU)

Technology which allows for partitioning of GPUs into instances.

Natural Language Processing

AI technology which enables computers to understand, interpret, and generate human languages.

NIC (Network Interface Controller)

A circuit which manages a computer’s communication with other devices connected to the network.

Node

An autonomous computer in a network or cluster of computers which processes and stores data.

NVLink

Interconnect technology developed by NVIDIA which enables faster data transfer between GPUs and a CPU.

NVSwitch

Technology developed by NVIDIA which provides high-speed connectivity between GPUs at the switch level.

OOB (Out of Band)

Technique of managing network devices by using networks which are different from data networks.

Optimization

The process of maximizing performance by altering certain parameters of a system.

Pre-training

The process of training AI models which allows for the acquisition of general knowledge.

RAID (Redundant Array of Independent Disks)

Technology which enables combining several hardware disks into one logical unit for data storage purposes.

SLA (Service Level Agreement)

Agreement which stipulates the standards which a service provider must uphold.

SLM (Small Language Model)

Language model designed to perform simple language-related tasks.

Tensor

Mathematical object which is used in linear algebra. They can be represented in the form of tensors, hence the name tensor processing unit (TPU).

Tensor Core

Specialized processing core which is in NVIDIA GPUs and is used for AI processing. It performs key functions, such as matrix multiplication.

TensorRT

High-performance deep learning inference library from NVIDIA.

Text-to-Image Generator

AI paradigm which transforms text commands into an image.

Text-to-Speech

AI technology which converts text into speech.

Throughput

Measurement of how much information can be processed by a system per unit of time. Its standard unit is bits per second (bps).

Virtualization

Technology which allows for the emulation of a computer system in another computer system.

Volta Architecture

GPU design by NVIDIA which is used in most AI applications due to containing tensor cores.

Table of Content

Conclusion

Generative AI has the potential to turbo-charge employees, processes, customers and your services. Generative AI is so powerful that it can make almost every aspect of your business more efficient. Sales, service, marketing, and commerce are already leveraging the power of generative AI to bring more innovative and customised solutions to market, faster. By harnessing AI to handle the day-to-day of supporting customers, you can free up time for humans to drive creativity and collaboration, building those vital connections that only people can provide.