AI research nonprofit EleutherAI releases the Common Pile v0.1, an 8TB dataset of licensed and open-domain text for AI models that it says is one of the largest (Kyle Wiggers/TechCrunch)

Kyle Wiggers / TechCrunch: AI research nonprofit EleutherAI releases the Common Pile v0.1, an 8TB dataset of licensed and open-domain text for AI models that it says is one of the largest  —  EleutherAI, an AI research organization, has released what it claims is one of the largest collections of licensed and open-domain text for training AI models.

AI research nonprofit EleutherAI releases the Common Pile v0.1, an 8TB dataset of licensed and open-domain text for AI models that it says is one of the largest (Kyle Wiggers/TechCrunch)

Kyle Wiggers / TechCrunch:
AI research nonprofit EleutherAI releases the Common Pile v0.1, an 8TB dataset of licensed and open-domain text for AI models that it says is one of the largest  —  EleutherAI, an AI research organization, has released what it claims is one of the largest collections of licensed and open-domain text for training AI models.

This article has been sourced from various publicly available news platforms around the world. All intellectual property rights remain with the original publishers and authors. Unshared News does not claim ownership of the content and provides it solely for informational and educational purposes voluntarily. If you are the rightful owner and believe this content has been used improperly, please contact us for prompt removal or correction.