Artwork

Contenuto fornito da Nicolay Gerold. Tutti i contenuti dei podcast, inclusi episodi, grafica e descrizioni dei podcast, vengono caricati e forniti direttamente da Nicolay Gerold o dal partner della piattaforma podcast. Se ritieni che qualcuno stia utilizzando la tua opera protetta da copyright senza la tua autorizzazione, puoi seguire la procedura descritta qui https://it.player.fm/legal.
Player FM - App Podcast
Vai offline con l'app Player FM !

Building the database for AI, Multi-modal AI, Multi-modal Storage | S2 E10

44:54
 
Condividi
 

Manage episode 446500349 series 3585930
Contenuto fornito da Nicolay Gerold. Tutti i contenuti dei podcast, inclusi episodi, grafica e descrizioni dei podcast, vengono caricati e forniti direttamente da Nicolay Gerold o dal partner della piattaforma podcast. Se ritieni che qualcuno stia utilizzando la tua opera protetta da copyright senza la tua autorizzazione, puoi seguire la procedura descritta qui https://it.player.fm/legal.

Imagine a world where data bottlenecks, slow data loaders, or memory issues on the VM don't hold back machine learning.

Machine learning and AI success depends on the speed you can iterate. LanceDB is here to to enable fast experiments on top of terabytes of unstructured data. It is the database for AI. Dive with us into how LanceDB was built, what went into the decision to use Rust as the main implementation language, the potential of AI on top of LanceDB, and more.

"LanceDB is the database for AI...to manage their data, to do a performant billion scale vector search."

“We're big believers in the composable data systems vision."

"You can insert data into LanceDB using Panda's data frames...to sort of really large 'embed the internet' kind of workflows."

"We wanted to create a new generation of data infrastructure that makes their [AI engineers] lives a lot easier."

"LanceDB offers up to 1,000 times faster performance than Parquet."

Change She:

LanceDB:

Nicolay Gerold:

00:00 Introduction to Multimodal Embeddings
00:26 Challenges in Storage and Serving
02:51 LanceDB: The Solution for Multimodal Data
04:25 Interview with Chang She: Origins and Vision
10:37 Technical Deep Dive: LanceDB and Rust
18:11 Innovations in Data Storage Formats
19:00 Optimizing Performance in Lakehouse Ecosystems
21:22 Future Use Cases for LanceDB
26:04 Building Effective Recommendation Systems
32:10 Exciting Applications and Future Directions

  continue reading

32 episodi

Artwork
iconCondividi
 
Manage episode 446500349 series 3585930
Contenuto fornito da Nicolay Gerold. Tutti i contenuti dei podcast, inclusi episodi, grafica e descrizioni dei podcast, vengono caricati e forniti direttamente da Nicolay Gerold o dal partner della piattaforma podcast. Se ritieni che qualcuno stia utilizzando la tua opera protetta da copyright senza la tua autorizzazione, puoi seguire la procedura descritta qui https://it.player.fm/legal.

Imagine a world where data bottlenecks, slow data loaders, or memory issues on the VM don't hold back machine learning.

Machine learning and AI success depends on the speed you can iterate. LanceDB is here to to enable fast experiments on top of terabytes of unstructured data. It is the database for AI. Dive with us into how LanceDB was built, what went into the decision to use Rust as the main implementation language, the potential of AI on top of LanceDB, and more.

"LanceDB is the database for AI...to manage their data, to do a performant billion scale vector search."

“We're big believers in the composable data systems vision."

"You can insert data into LanceDB using Panda's data frames...to sort of really large 'embed the internet' kind of workflows."

"We wanted to create a new generation of data infrastructure that makes their [AI engineers] lives a lot easier."

"LanceDB offers up to 1,000 times faster performance than Parquet."

Change She:

LanceDB:

Nicolay Gerold:

00:00 Introduction to Multimodal Embeddings
00:26 Challenges in Storage and Serving
02:51 LanceDB: The Solution for Multimodal Data
04:25 Interview with Chang She: Origins and Vision
10:37 Technical Deep Dive: LanceDB and Rust
18:11 Innovations in Data Storage Formats
19:00 Optimizing Performance in Lakehouse Ecosystems
21:22 Future Use Cases for LanceDB
26:04 Building Effective Recommendation Systems
32:10 Exciting Applications and Future Directions

  continue reading

32 episodi

Tutti gli episodi

×
 
Loading …

Benvenuto su Player FM!

Player FM ricerca sul web podcast di alta qualità che tu possa goderti adesso. È la migliore app di podcast e funziona su Android, iPhone e web. Registrati per sincronizzare le iscrizioni su tutti i tuoi dispositivi.

 

Guida rapida