large-scale-deployment

Here are 2 public repositories matching this topic...

ksm26 / Efficiently-Serving-LLMs

Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Predibase’s LoRAX framework inference server.

text-generation batch-processing server-optimization model-serving model-acceleration inference-optimization optimization-techniques machine-learning-operations deep-learning-techniques model-inference-service performance-enhancement scalability-strategies serving-infrastructure large-scale-deployment

Updated Apr 12, 2024
Jupyter Notebook

christycorrupt935 / Hands-On-Large-Language-Models

Star

🛠️ Explore large language models through hands-on projects and tutorials to enhance your understanding and practical skills in natural language processing.

agent text-generation multi-agent gpt model-acceleration optimization-techniques opentelemetry llm generative-ai scalability-strategies vulnerable-llm-application serving-infrastructure large-scale-deployment ai-security-testing prompt-injection-llm-security hans-on-llms prompt-injection-defense llm-red-teaming

Updated Nov 3, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the large-scale-deployment topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the large-scale-deployment topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly