Listen

Description

In our latest episode, we sit down with Derek Tu, Founder and CEO of Carbon, a cutting-edge ETL tool designed specifically for large language models (LLMs).

Carbon is streamlining AI development by providing a platform for integrating unstructured data from various sources, enabling businesses to build innovative AI applications more efficiently while addressing data privacy and ethical concerns.

Derek Tu:

Nicolay Gerold:

Key Takeaways:

00:00 Introduction and Optimizing Embedding Models

03:00 The Evolution of Carbon and Focus on Unstructured Data

06:19 Customer Progression and Target Group

09:43 Interesting Use Cases and Handling Different Data Representations

13:30 Chunking Strategies and Normalization

20:14 Approach to Chunking and Choosing a Vector Database

23:06 Tech Stack and Recommended Tools

28:19 Future of Carbon: Multimodal Models and Building a Platform

Carbon, LLMs, RAG, chunking, data processing, global customer base, GDPR compliance, AI founders, AI agents, enterprises