Skip to content
DEE
Database Engineering Essentials

[DEE-5] Glossary

INFO

Common terms used across DEE documents.

Database Fundamentals

TermDefinition
ACIDAtomicity, Consistency, Isolation, Durability -- properties guaranteeing reliable transaction processing
BASEBasically Available, Soft state, Eventually consistent -- alternative to ACID for distributed systems
CAP TheoremA distributed system can guarantee at most two of: Consistency, Availability, Partition tolerance
DDLData Definition Language -- SQL statements that define schema (CREATE, ALTER, DROP)
DMLData Manipulation Language -- SQL statements that manipulate data (SELECT, INSERT, UPDATE, DELETE)
SchemaThe structure of a database: tables, columns, types, constraints, and their relationships
TransactionA unit of work that is either fully completed or fully rolled back

Relational Concepts

TermDefinition
NormalizationProcess of organizing data to reduce redundancy and improve integrity
Primary KeyA column (or set of columns) that uniquely identifies each row in a table
Foreign KeyA column that references the primary key of another table, enforcing referential integrity
IndexA data structure that improves query speed at the cost of additional storage and write overhead
JoinAn operation combining rows from two or more tables based on a related column
Surrogate KeyAn artificial key (e.g., auto-increment ID, UUID) with no business meaning
Natural KeyA key derived from real-world data (e.g., email, SSN)

NoSQL Concepts

TermDefinition
Document StoreDatabase storing data as semi-structured documents (e.g., MongoDB, CouchDB)
Key-Value StoreDatabase storing data as key-value pairs (e.g., Redis, DynamoDB)
Column-Family StoreDatabase organizing data into column families (e.g., Cassandra, HBase)
Graph DatabaseDatabase using graph structures with nodes, edges, and properties (e.g., Neo4j)
Eventual ConsistencyA consistency model where replicas converge to the same state over time

Operations

TermDefinition
ReplicationCopying data across multiple database instances for availability and read scaling
ShardingDistributing data across multiple databases based on a partition key
MigrationA versioned change to the database schema
Connection PoolA cache of database connections maintained for reuse