LIBRISTO
LIBROAMANTO
mandatory
Become part of a community of book lovers from all over the world and get access to a whole bunch of benefits. Create an account for free
0
Free delivery for purchases over 69.99 €
DPD courier 5.99 Bpost point 7.99 Bpost 7.49 DPD point 3.49 GLS courier 4.99

Free delivery for orders over 69.99 euro.

Hands-On LLM Serving and Optimization

Hosting LLMs at Scale

Language EnglishEnglish
E-book Adobe ePub DRM
Publishers O'Reilly Media, April 2026
Large language models (LLMs) are the reasoning engines of modern AI. Today, a major inflection point... Full description
? points 168 b Top Top New New
69.51
In stock Immediate digital delivery


Customers also purchased


Top
Beyond Vibe Coding Addy Osmani / E-book Adobe ePub DRM
common.buy 61.20
Top
Bayesian Statistics The Fun Way Will Kurt / Book Paperback
common.buy 30.60
Top
Bayesian Analysis with Python Osvaldo Martin / Book Paperback
common.buy 56.34
Top
Steal Like an Artist Austin Kleon / Book Paperback
common.buy 11.75
Top
Krótka historia Europy Simon Jenkins / Book Hardback
common.buy 14.99
Dona nobis pacem Klaus Heizmann / Book Paperback
common.buy 22.89
Stitch! Best Friends Forever! Tony Toshimitsu Tran / Book Paperback
common.buy 8.00

Large language models (LLMs) are the reasoning engines of modern AI. Today, a major inflection point has arrived: as the world races to deploy AI at scale, model inference has moved to the center of the stack. Welcome to the inference era. Without proper optimization, however, LLMs can be expensive and slow to serve. Hands-On LLM Serving and Optimization is a comprehensive guide to the complexities of deploying and optimizing LLMs at scale.In this hands-on, engineering-focused book, authors Chi Wang and Peiheng Hu combine practical examples, code, and strategies for building robust, performant, and cost-efficient AI token factories. Whether you re building the LLM inference infrastructure or the applications that consume it, a deep understanding of LLM serving will make you a more effective, future-ready engineer as AI transforms how we work and build.Learn the foundations of model serving with core concepts, design paradigms, and industry best practicesUnderstand the common challenges of hosting LLMs at scaleBalance latency and throughput to meet the demands of AI applications and business requirementsHost LLMs cost-effectively with practical, code-backed techniques

Actress & Polyglot
EWA KASP for
Play video
Ewa Kasp
Libristo has the largest selection of foreign-language books. That’s why I buy my books there.

About the book

Full name Hands-On LLM Serving and Optimization
Language English
Binding E-book - Adobe ePub DRM
Date of issue 2026
EAN 9798341621466
Libristo code 52376258
Publishers O'Reilly Media
Give this book today
It's easy
1 Add to cart and choose Deliver as present at the checkout 2 We'll send you a voucher 3 The book will arrive at the recipient's address

You might also be interested in


Christianity the Religion of Nature Andrew P. Peabody / Book Paperback
common.buy 28.27
Bayesian Analysis with Python Osvaldo Martin / Book Paperback
common.buy 49.75
Another Time, Another Place Jeanie I Larson / Book Paperback
common.buy 13.47
Do You Really Want to Meet Velociraptor? Annette Bay Pimentel / Book Hardback
common.buy 40.12
The Dancing Partner Jerome K Jerome / Book Paperback
common.buy 15.90
Outcry Harold Schechter / Book Paperback
common.buy 22.79

Login

Log in to your account. Don't have a Libristo account? Create one now!

 
mandatory
mandatory

Don’t have an account? Discover the benefits of having a Libristo account!

With a Libristo account, you'll have everything under control.

Create a Libristo account
Book advisor Libroamiko
Hi, I'm Libroamiko, can I help?