The integration of AI models into web applications is revolutionizing user experiences, but the large size of these models poses a significant challenge for performance. Slow loading times and unresponsive interactions can frustrate users and hinder adoption. This talk will delve into the critical aspect of efficient AI model caching in Chrome, exploring strategies to optimize model delivery and ensure seamless user experiences.
We'll cover:
-
The performance implications of large AI models in web applications.
-
Best practices for configuring cache headers to leverage browser caching effectively.
-
In-depth comparison of client-side caching techniques: Cache API, Origin Private File System (OPFS), and IndexedDB.
-
Advanced strategies for handling user-selected models and large file downloads.
-
Security considerations and trade-offs when caching AI models locally.
Whether you're a web developer, AI engineer, or performance enthusiast, this talk will equip you with the knowledge and tools to deliver lightning-fast AI-powered web experiences. Learn how to optimize model loading, reduce latency, and create web applications that seamlessly integrate AI capabilities without compromising performance.
Agenda
---
Speaker
Rabimba Karanjai - Rice University (Graduate Researcher)
Rabimba Karanjai is a full-time graduate researcher, part-time hacker, and FOSS enthusiast. He works with I2C lab at University of Houston and Habenaro lab at Rice University as part of his PhD. He has previously worked with Mozilla Research Mixed Reality team and IBM Research Systems group. He is a Google Developer Expert in Web Technology and Machine Learning, Google Champions Innovator in C…
Hosted By
Aan Patel, GDG Organizer
Hi! 👋
I'm a full-stack software developer. In my free time, I enjoy playing with new technologies, building prototypes, and connecting with others on platforms such as this one to grow my skills and share knowledge.
Thanks for stopping by. Visit https://aanpatel.tech to learn more about me or to get in touch!
Tobenna Okunna, Organizer
Hi all, my name is Tobenna Okunna. I build apps, primarily aimed at addressing material needs and improving outcomes for under-served communities.
Im looking forward to collaborating and learning with the engineering community.
A little bit about me:
- Originally from Nigeria
- Graduated w/ a CS degree from the University of San Diego in 2020
- Lead teams at multiple startup
- I have tons of side projects
- Played D1 football
- I'm an avid weightlifter
Jasmine Gallaway, Organizer
Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-chapel-hill-presents-turbocharging-web-apps-efficient-ai-model-caching-in-chrome/.