HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
61.
▲
67% Cost Savings with PD Disaggregation Using Ray and vLLM on AMD MI325X
anyscale.com
discuss
7 days ago
robertnishihara
4 points
62.
▲
Loading Llama-2 70B 20x faster with Anyscale Endpoints
anyscale.com
discuss
3 years ago
robertnishihara
4 points
63.
▲
Cloud Infrastructure for LLM and Generative AI Applications
anyscale.com
discuss
3 years ago
ameerh
4 points
64.
▲
Model Batch Inference in Ray: Actors, ActorPool, and Datasets
anyscale.com
discuss
4 years ago
jsd_dmatrix
4 points
65.
▲
Ben Lorica blog post on enterprise applications of reinforcement learning
anyscale.com
discuss
6 years ago
dwampler
4 points
66.
▲
Ant Group – scaling to 1.37M QPS on Ray
anyscale.com
1 comment
4 years ago
george_123
3 points
67.
▲
Ant Group Uses Ray to Build a Large-Scale Online Serverless Platform
anyscale.com
1 comment
4 years ago
jsd_dmatrix
3 points
68.
▲
High Performance Distributed Inference with Ray Serve LLM
anyscale.com
discuss
5 days ago
robertnishihara
3 points
69.
▲
Joins and Hash-Shuffle in Ray Data
anyscale.com
discuss
a year ago
robertnishihara
3 points
70.
▲
An OSS Stack for AI Compute: Kubernetes + Ray + PyTorch + LLM
anyscale.com
discuss
a year ago
gabe_monroy
3 points
71.
▲
Building Rag-Based LLM Applications for Production
anyscale.com
discuss
3 years ago
robertnishihara
3 points
72.
▲
Anyscale Private Endpoints and Anyscale Endpoints Fine-Tuning
anyscale.com
discuss
3 years ago
robertnishihara
3 points
73.
▲
Loading Llama-2 70B 20x faster with Anyscale Endpoints
anyscale.com
discuss
3 years ago
george_123
3 points
74.
▲
How to build a LLM search engine using a self-hosted LLM
anyscale.com
discuss
3 years ago
richardliaw
3 points
75.
▲
An informal introduction to reinforcement learning
anyscale.com
discuss
4 years ago
robertnishihara
3 points
76.
▲
Why Third Generation ML Platforms Are More Performant
anyscale.com
discuss
5 years ago
robertnishihara
3 points
77.
▲
The Third Generation of Production ML Architectures
anyscale.com
discuss
5 years ago
robertnishihara
3 points
78.
▲
Ray 1.0
anyscale.com
discuss
6 years ago
robertnishihara
3 points
79.
▲
Enterprise Applications of Reinforcement Learning
anyscale.com
discuss
6 years ago
robertnishihara
3 points
80.
▲
Major upgrades to Ray Serve: 88% lower latency and 11.1x higher throughput
anyscale.com
1 comment
3 months ago
robertnishihara
2 points
81.
▲
Mobile App Marketing Tool
appscue.com
discuss
12 years ago
appscue
2 points
82.
▲
Data Processing Is Becoming a GPU Workload
anyscale.com
discuss
7 days ago
robertnishihara
2 points
83.
▲
Massively Parallel Agentic Simulations with Ray
anyscale.com
discuss
9 months ago
robertnishihara
2 points
84.
▲
Native LLM APIs in Ray Data and Ray Serve
anyscale.com
discuss
a year ago
robertnishihara
2 points
85.
▲
Large-Scale Deployment of Ray in Tencent's Weixin AI Infrastructure
anyscale.com
discuss
a year ago
robertnishihara
2 points
86.
▲
Anyscale Appoints Keerti Melkote as CEO
anyscale.com
discuss
2 years ago
dnnssl2
2 points
87.
▲
Canva Built a Modern AI Platform Using Anyscale
anyscale.com
discuss
2 years ago
robertnishihara
2 points
88.
▲
Building RAG-Based LLM Applications for Production
anyscale.com
discuss
2 years ago
robertnishihara
2 points
89.
▲
Comparing LLM Performance: Introducing the Open Source Leaderboard for LLM APIs
anyscale.com
discuss
3 years ago
robertnishihara
2 points
90.
▲
Anyscale Endpoints: JSON Mode and Function Calling Features
anyscale.com
discuss
3 years ago
robertnishihara
2 points
More