HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
61.
▲
Loading Llama-2 70B 20x faster with Anyscale Endpoints
anyscale.com
1 comment
3 years ago
fgfm
1 points
62.
▲
Cross-language, distributed model inference framework: Serve with Java API
anyscale.com
1 comment
4 years ago
jsd_dmatrix
1 points
63.
▲
Building Highly Available and Scalable Online Applications on Ray at Ant Group
anyscale.com
1 comment
5 years ago
robertnishihara
1 points
64.
▲
LLM Inference with Ray: Expert parallelism and prefill/decode disaggregation
anyscale.com
discuss
7 months ago
mycelia
1 points
65.
▲
LLM Engine Orchestration for Performance
anyscale.com
discuss
9 months ago
mycelia
1 points
66.
▲
Deploy DeepSeek‑R1 with VLLM and Ray Serve on Kubernetes
anyscale.com
discuss
10 months ago
robertnishihara
1 points
67.
▲
An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM
anyscale.com
discuss
a year ago
robertnishihara
1 points
68.
▲
Open Source RL Libraries for LLMs
anyscale.com
discuss
a year ago
robertnishihara
1 points
69.
▲
An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM
anyscale.com
discuss
a year ago
robertnishihara
1 points
70.
▲
Uv and Ray: Pain-Free Python Dependencies in Clusters
anyscale.com
discuss
a year ago
robertnishihara
1 points
71.
▲
Direct Preference Optimization with Synthetic Data on Anyscale
anyscale.com
discuss
2 years ago
robertnishihara
1 points
72.
▲
Building an LLM Router for High-Quality and Cost-Effective Responses
anyscale.com
discuss
2 years ago
robertnishihara
1 points
73.
▲
End-to-End LLM Workflows Guide
anyscale.com
discuss
2 years ago
GokuMohandas
1 points
74.
▲
Fine-tuning LLMs for longer context and better RAG systems
anyscale.com
discuss
2 years ago
robertnishihara
1 points
75.
▲
RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone
anyscale.com
discuss
2 years ago
robertnishihara
1 points
76.
▲
Anyscale Endpoints: JSON Mode and Function Calling Features
anyscale.com
discuss
3 years ago
tosh
1 points
77.
▲
LLM summarization: A case study of human, Llama-2, & GPT-4 summarization quality
anyscale.com
discuss
3 years ago
robertnishihara
1 points
78.
▲
Building Rag-Based LLM Applications for Production
anyscale.com
discuss
3 years ago
akbarnama
1 points
79.
▲
Anyscale Endpoints: LLM inference and fine-tuning
docs.endpoints.anyscale.com
discuss
3 years ago
robertnishihara
1 points
80.
▲
ByteDance Scales Offline Inference with Multi-Modal LLMs to 200 TB Data
anyscale.com
discuss
3 years ago
jamesblonde
1 points
81.
▲
Ray solves common production challenges for generative AI infrastructure
anyscale.com
discuss
3 years ago
tim_sw
1 points
82.
▲
Training One Million Machine Learning Models in Record Time with Ray
anyscale.com
discuss
4 years ago
robertnishihara
1 points
83.
▲
Gang Scheduling Ray Clusters on K8s with Multi-Cluster-App-Dispatcher (MCAD)
anyscale.com
discuss
4 years ago
jsd_dmatrix
1 points
84.
▲
Redis in Ray: Past and Future
anyscale.com
discuss
4 years ago
chandlergibb
1 points
85.
▲
Ray 1.11 Released
anyscale.com
discuss
4 years ago
chandlergibb
1 points
86.
▲
Introducing Anyscale: The Future Is Distributed
anyscale.com
discuss
5 years ago
robertnishihara
1 points
87.
▲
Analyzing memory management and performance in Dask-on-Ray
anyscale.com
discuss
5 years ago
robertnishihara
1 points
88.
▲
Parallelizing Python Code
anyscale.com
discuss
5 years ago
sebg
1 points
89.
▲
Ant Group's Resource Allocation System has scaled over 6000 cores
anyscale.com
discuss
5 years ago
Corgipower12
1 points
90.
▲
Ask HN: Is PySPark a Dead-End?
9 comments
5 years ago
passer_byer
9 points
More