Editing
Ai Infrastructure Mlops
(section)
Jump to navigation
Jump to search
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
== <span style="color: #FFFFFF;">Remembering</span> == * '''MLOps''' β Machine Learning Operations; practices and tools for deploying and maintaining ML models in production reliably and efficiently. * '''ML pipeline''' β An automated sequence of steps: data ingestion β preprocessing β training β evaluation β deployment β monitoring. * '''Feature store''' β A centralized repository for computing, storing, and serving ML features consistently across training and inference. * '''Model registry''' β A central catalog tracking model versions, metadata, performance metrics, and deployment status. * '''Experiment tracking''' β Recording hyperparameters, metrics, artifacts, and code for each training run to enable comparison and reproducibility. * '''Model serving''' β The infrastructure for running trained models and serving predictions via API or batch processing. * '''Inference server''' β A system optimized for serving model predictions at high throughput and low latency (Triton, TorchServe, vLLM). * '''Continuous Training (CT)''' β Automatically retraining models on fresh data to prevent performance degradation. * '''Data drift''' β A change in the statistical distribution of input data over time, degrading model performance. * '''Concept drift''' β A change in the relationship between input features and the target variable, requiring model retraining. * '''CI/CD for ML''' β Automated testing and deployment pipelines adapted for ML workflows. * '''GPU cluster''' β A collection of GPUs used for distributed model training and inference. * '''Kubernetes''' β A container orchestration platform used to deploy and scale ML services. * '''Ray''' β A distributed computing framework for Python ML workloads. * '''Weights & Biases (W&B)''' β A popular experiment tracking and model management platform. * '''Kubeflow''' β An ML workflow orchestration platform built on Kubernetes. </div> <div style="background-color: #006400; color: #FFFFFF; padding: 20px; border-radius: 8px; margin-bottom: 15px;">
Summary:
Please note that all contributions to BloomWiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
BloomWiki:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Create account
Log in
Namespaces
Page
Discussion
English
Views
Read
Edit
View history
More
Search
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Tools
What links here
Related changes
Special pages
Page information