Jalansh Munshi
Jalansh Munshi
Home
Experience
Posts
Projects
Resume
Light
Dark
Automatic
Large scale deployment
Multi GPU Deployment for a Large AI Model
Using ray serve to deploy a large AI model on multiple GPUs as an API endpoint.
Jalansh Munshi
Last updated on Dec 15, 2024
7 min read
Cite
×