LiteLlama-460M-1T has 460M parameters trained with 1T tokenshuggingface.co54 pointsdmezzetti2 years ago