Search: hazyresearch.stanford.edu | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

31.

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

hazyresearch.stanford.edu

3 years ago

3 points

32.

HyenaDNA: Learning from DNA with 1M token context

hazyresearch.stanford.edu

3 years ago

3 points

33.

AI's Linux Moment: An Open-Source AI Model Love Note

hazyresearch.stanford.edu

3 years ago

3 points

34.

Minions: The rise of small, on-device LMs

hazyresearch.stanford.edu

a year ago

2 points

35.

Zoology 1: Measuring and Improving Recall in Efficient Language Models

hazyresearch.stanford.edu

3 years ago

2 points

36.

Pixelated Butterfly

hazyresearch.stanford.edu

3 years ago

2 points

37.

Stuffing MLPs Full of Facts: A Generative Approach to Factual Recall

hazyresearch.stanford.edu

7 months ago

2 points

38.

ThunderMittens for Your ThunderKittens

hazyresearch.stanford.edu

7 months ago

2 points

39.

An Unserious Persons Take on Axiomatic Knowledge in the Era of Foundation Models

hazyresearch.stanford.edu

2 years ago

2 points

40.

An Unserious Take on Axiomatic Knowledge in the Era of Foundation Models

hazyresearch.stanford.edu

2 years ago

2 points

41.

Linearizing LLMs with LoLCATs

hazyresearch.stanford.edu

2 years ago

2 points

42.

Efficient language models as arithmetic circuits

hazyresearch.stanford.edu

2 years ago

2 points

43.

Combining Continuous-Time, Recurrent, and Convolutional Models

hazyresearch.stanford.edu

3 years ago

2 points

44.

HyenaDNA: Learning from DNA with 1M token context

hazyresearch.stanford.edu

3 years ago

2 points

45.

Hyena Hierarchy: Towards Larger Convolutional Language Models

hazyresearch.stanford.edu

3 years ago

2 points

46.

From Deep to Long Learning?

hazyresearch.stanford.edu

3 years ago

2 points

47.

Based: An Educational and Effective Sequence Mixer

hazyresearch.stanford.edu

3 years ago

1 points

48.

ThunderKittens 2.0: even faster kernels for your GPUs

hazyresearch.stanford.edu

4 months ago

1 points

49.

Loads and Loads of Fluffy Kittens

hazyresearch.stanford.edu

7 months ago

1 points

50.

Intelligence per Watt: A Study of Local Intelligence Efficiency

hazyresearch.stanford.edu

7 months ago

1 points

51.

Cartridges: Store long contexts in tiny caches with LLM self-study

hazyresearch.stanford.edu

a year ago

1 points

52.

Mind the Trust Gap: Fast, Private Local-to-Cloud LLM Chat

hazyresearch.stanford.edu

a year ago

1 points

53.

Correcting and Improving LLM Predictions Without Labels

hazyresearch.stanford.edu

3 years ago

1 points

54.

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

hazyresearch.stanford.edu

3 years ago

1 points

55.

H3: Language Modeling with State Space Models and (Almost) No Attention

hazyresearch.stanford.edu

3 years ago

1 points

56.

Hyena Hierarchy: Towards Larger Convolutional Language Models

hazyresearch.stanford.edu

3 years ago

1 points

57.

Chris Re: Is AI Rare or Everywhere?

hazyresearch.stanford.edu

3 years ago

1 points

58.

Hyena Hierarchy: Towards Larger Convolutional Language Models

hazyresearch.stanford.edu

3 years ago

1 points

59.

Can Longer Sequences Help Take the Next Leap in AI? · Hazy Research

hazyresearch.stanford.edu

4 years ago

1 points

60.

HiPPO: Recurrent Memory with Optimal Polynomial Projections

hazyresearch.stanford.edu

5 years ago

1 points