Boost ML model serving with dynamic batching and CPU/GPU pipelines for maximum efficiency.

Mosec is a high-performance framework designed for serving machine learning models efficiently. It leverages dynamic batching and CPU/GPU pipelines to maximize your compute resources. Built with Rust, it ensures fast web layer and task coordination, while offering a user-friendly Python interface. Mosec supports cloud deployment with features like model warmup and Prometheus monitoring, easily managed by Kubernetes. With its focus on online serving, it allows you to concentrate on model optimization and business logic. Installation is straightforward via pip or conda, and it supports various ML frameworks. Mosec is ideal for those looking to enhance their ML model serving capabilities in a scalable and efficient manner.
+4 more
+4 more