ML research datasets from ArXiv and Semantic Scholar (JSONL, quality-scored)huggingface.co3 pointsdangerlego510 days ago