A data management system for storing, versioning, and serving machine learning features consistently across training and inference. Feature stores ensure that multimodal AI systems use the same feature computation logic in development and production.
A feature store provides a centralized registry of feature definitions, a computation engine that materializes features from raw data, and a serving layer that provides features at training and inference time. Features are computed once and reused across models, ensuring consistency. The store handles both batch features (computed periodically) and real-time features (computed on demand).
Platforms include Feast (open-source), Tecton, Hopsworks, and SageMaker Feature Store. Storage backends split between offline stores (S3, BigQuery for training) and online stores (Redis, DynamoDB for inference). Feature definitions include transformation logic, data sources, entity keys, and serving parameters. Point-in-time correct joins prevent data leakage during training. Feature freshness SLAs govern update frequency.