Listen

Description

Thousands of YouTube video transcripts were possibly used to train AI models without consent. The data set included transcripts from popular channels and shows, as well as transcripts from well-known YouTubers. The dataset was created by a non-profit organization called Luther AI, who aims to accelerate AI development by allowing open access to data. The use of this data set is protected under fair use, according to the AI companies involved. However, there is still ambiguity around the licensing and legality of using AI tools and AI generation.

Keywords

YouTube, video transcripts, AI models, consent, data set, Luther AI, fair use, licensing, legality

Takeaways

Links:

https://www.proofnews.org/apple-nvidia-anthropic-used-thousands-of-swiped-youtube-videos-to-train-ai/

https://www.theatlantic.com/technology/archive/2023/08/books3-ai-meta-llama-pirated-books/675063/

Follow Digital Pop:

⁠⁠⁠⁠⁠⁠⁠⁠⁠LinkedIn⁠⁠⁠⁠⁠⁠⁠⁠⁠

⁠⁠⁠⁠⁠⁠⁠⁠⁠YouTube⁠⁠⁠⁠⁠⁠⁠⁠⁠

⁠⁠⁠⁠⁠⁠⁠⁠⁠X (Twitter)⁠⁠⁠⁠⁠⁠⁠⁠⁠

⁠⁠⁠⁠⁠⁠⁠⁠⁠Facebook⁠⁠⁠⁠⁠⁠⁠⁠⁠

⁠⁠⁠⁠⁠⁠⁠⁠⁠Threads⁠