I’ve just released an episode with Sonam Pankaj. She works on EmbedAnything. We have recorded this episode at Berlin Buzzwords back in June, where I also got the chance to test my new audio recording gear (RØDE Wireless GO II).
EmbedAnything is an infrastructure layer, that allows you to embed anything (different text formats, but also other modalities, like audio), written in Rust for performance reasons. It can embed a pdf text 40x faster than in Python.
We spoke about this project, but also about metric learning, quality assurance and multimodality.
There are a bunch of show notes with different papers and projects — do check them out.
Find the episode on these platforms in addition to YouTube:
RSS: https://rss.com/podcasts/vector-podcast/1663042/
Spotify: https://open.spotify.com/episode/5pUWz19iWKHqUzNT0JQ9KL
Apple Podcasts: https://podcasts.apple.com/fi/podcast/berlin-buzzwords-2024-sonam-pankaj-embedanything/id1587568733?i=1000670040161
Big thanks to @srbhr for designing the thumbnail of this episode.