AI2 Unveils Largest Open Dataset for Training Language Models The Allen Institute for AI (AI2) has unveiled an expansive open dataset named "Dolma." by Jace Dela Cruz