Generative AI on Kubernetes: Operationalizing Large Language Models 1st Edition

★★★★★ 4.2 19 reviews

$55.54
Price when purchased online
Free shipping Free 30-day returns

Sold and shipped by hasur-hasur.com
We aim to show you accurate product information. Manufacturers, suppliers and others provide what you see here.
$55.54
Price when purchased online
Free shipping Free 30-day returns

How do you want your item?
You get 30 days free! Choose a plan at checkout.
Shipping
Arrives May 9
Free
Pickup
Check nearby
Delivery
Not available

Sold and shipped by hasur-hasur.com
Free 30-day returns Details

Product details

Management number 219166531 Release Date 2026/05/03 List Price $22.22 Model Number 219166531
Category

Generative AI is revolutionizing industries, and Kubernetes has fast become the backbone for deploying and managing these resource-intensive workloads. This book serves as a practical, hands-on guide for MLOps engineers, software developers, Kubernetes administrators, and AI professionals ready to combine AI innovation with the power of cloud native infrastructure. Authors Roland Huß and Daniele Zonca provide a clear road map for training, fine-tuning, deploying, and scaling GenAI models on Kubernetes, addressing challenges like resource optimization, automation, and security along the way.With actionable insights with real-world examples, readers will learn to tackle the opportunities and complexities of managing GenAI applications in production environments. Whether you're experimenting with large-scale language models or facing the nuances of AI deployment at scale, you'll uncover expertise you need to operationalize this exciting technology effectively.Learn how to deploy LLMs more efficiently with optimized inference runtimesGet hands-on with GPU scheduling, including hardware detection and multinode scalingMonitor and understand LLM-specific metrics like Time to First Token and token throughputKnow when to fine-tune a model or when retrieval augmentation is the better choiceDiscover how to evaluate models with standardized benchmarks before committing GPU resourcesLearn to run agentic applications with secure tool integration, identity management, and persistent state Read more

ISBN10 1098171926
ISBN13 978-1098171926
Edition 1st
Language English
Publisher O'Reilly Media
Dimensions 7 x 2 x 9.19 inches
Item Weight 1.54 pounds
Print length 404 pages
Publication date April 7, 2026

Correction of product information

If you notice any omissions or errors in the product information on this page, please use the correction request form below.

Correction Request Form

Customer ratings & reviews

4.2 out of 5
★★★★★
19 ratings | 8 reviews
How item rating is calculated
View all reviews
5 stars
78% (15)
4 stars
6% (1)
3 stars
3% (1)
2 stars
2% (0)
1 star
11% (2)
Sort by

There are currently no written reviews for this product.