Is your data infra struggling to meet the performance demands of #AI and #machinelearning workloads? This technical whitepaper demonstrates critical architectural considerations for optimizing data access in enterprise AI infrastructure. You will learn: 🐢 Common data access challenges that slow down your GPUs 🔍 Why NFS/NAS may not be your best choice 💯 Reference architecture on AWS and benchmarks test results Read Now: https://buff.ly/49aLMDD #gpu #GTC #GTC24 #nfs #nas #modeltraining #llm #storage #compute #machinelearning #AI #ML #genAI
Alluxio
Software Development
San Mateo, California 3,566 followers
Open Source Data Orchestration for Analytics and Machine Learning in the Cloud
About us
Proven at global web scale in production for modern data services, Alluxio is the developer of open source data orchestration software for the cloud. Alluxio moves data closer to big data and machine learning compute frameworks in any cloud across clusters, regions, clouds and countries, providing memory-speed data access to files and objects. Intelligent data tiering and data management deliver consistent high performance to customers in financial services, high tech, retail and telecommunications. Alluxio is in production use today at seven out of the top ten internet companies. Venture-backed by Andreessen Horowitz and Seven Seas Partners, Alluxio was founded at UC Berkeley’s AMPLab by the creators of the Tachyon open source project. For more information, contact info@alluxio.com.
- Website
-
https://www.alluxio.io/
External link for Alluxio
- Industry
- Software Development
- Company size
- 51-200 employees
- Headquarters
- San Mateo, California
- Type
- Privately Held
- Founded
- 2015
Locations
-
Primary
1825 S Grant St
Suite 800
San Mateo, California 94402, US
Employees at Alluxio
Updates
-
📹 In this video, Hope Wang, developer advocate at Alluxio, unravels the evolution and rising popularity of open-source data lakes for big data analytics and AI workloads. Gain insights into: 💡 The evolution of data platform architectures 📈 Trends in cloud-native approaches ✨ The importance of efficient data retrieval and bandwidth for analysis Click to watch now: https://buff.ly/3QkR9sz #DataCouncil #data #io #ai #modeltraining #machinelearning #storage #compute #anlytic #cloud
Tackling I/O Challenges in Modern Data Lakes
https://www.youtube.com/
-
Prefill in LLM inference is known to be resource-intensive, especially for long LLM inputs. While better scheduling can mitigate prefill’s impact, it would be fundamentally better to avoid (most of) prefill. Mark your calendars for “Reducing Prefill for LLM Serving in RAG”, an enlightening session by Junchen Jiang, Assistant Professor of computer science at University of Chicago where He will introduce how to speed up prefill delay while maintaining the same generation quality by improving the loading process of the reused KV cache. Learn More: #data #ai #llm #gpu #NVIDIA #modeltraining #machinelearning #deeplearning #storage #compute #caching #dataplatform #cloud
AI/ML Infra Meetup at Uber | Alluxio
https://www.alluxio.io
-
Don’t miss our coming up AI/ML Infra Meetup, a hybrid community event happening at Uber Sunnyvale and on zoom. Introducing the last talk for the night, “My Perspective on Deep Learning Framework”, presents by Xiande Cao, Senior Deep Learning Software Engineer Manager at #NVIDIA. From Caffe to MXNet, to PyTorch, he will share his perspective on the evolution of deep learning frameworks. Learn More:
AI/ML Infra Meetup at Uber | Alluxio
https://www.alluxio.io
-
Slow #data access and low #GPU utilization can bottleneck end-to-end machine learning pipelines as training data volume grows and when large model files are more commonly used for serving. These challenges are prevalent when using popular frameworks like #PyTorch, #Ray, or #HuggingFace, paired with cloud object storage solutions like #S3 or #GCS. Join us at AI/ML Infra Meetup at Uber where Lu Qiu and Siyuan Sheng will offer comprehensive insights into improving speed and GPU utilization for model training and serving. RSVP:
AI/ML Infra Meetup at Uber | Alluxio
https://www.alluxio.io
-
👏 We are honored to be recognized as one of the leading companies in the sphere of big data management and integration tools, as acknowledged in part 4 of the 2024 CRN Big Data 100 list. Resides between compute and storage, Alluxio brings data closer to accelerate heavy duty analytics/ai workload. Read this article to learn more:
The Coolest Big Data Management And Integration Tool Companies Of The 2024 Big Data 100
crn.com
-
🙌 Can't wait to meet you all at Trino Fest 2024!!
Trino Fest will be awesome - thanks to our speakers, attendees and also to our sponsors. Today we welcome Alluxio as sponsor. Thank you for supporting the fest and the community! https://lnkd.in/g8_yRNGY Register now and send your props to to Starburst, Onehouse, Cloudinary, and Upsolver, and now also Alluxio
A sneak peek of Trino Fest 2024
trino.io
-
🚗 #Uber builds and maintains one of the largest scale ML infrastructure, with over 1000 pipelines daily, for training an extensive number of models being used across various aspects of the business. Join Qiushen (Eric) Wang, Software Engineer at Uber’s Michelangelo team at our AI/ML Infra Meetup for his session “Optimizing Data Pipeline on Uber’s ML Platform” to learn challenges introduced by such a massive scale of data pipeline. You can expect: 🚀 Enhancing Data Access Speed & Efficiency 🔝 How to maximize CPU & GPU utilization in training epochs 🤝 Shared data infrastructure for collaborative project efficiency #data #ai #llm #gpu #NVIDIA #modeltraining #machinelearning #deeplearning #storage #compute #caching #dataplatform #cloud
AI/ML Infra Meetup at Uber | Alluxio
https://www.alluxio.io
-
The third part of our multi-cloud series webinar, “Cloud-Native Model Training on Distributed Data” is only one week away! Don’t miss the chance to embrace hybrid or multi-cloud architecture for large-scale #analytics and #AI workloads. Save your spot now: https://buff.ly/3U1rq9F #cloud #multicloud #hybridcloud #compute #storage #modeltraining #machinelearing
Welcome! You are invited to join a webinar: Alluxio Monthly Webinar | Multi-Cloud Webinar Series: Cloud-Native Model Training on Distributed Data. After registering, you will receive a confirmation email about joining the webinar.
-
🌟 Our exciting hybrid community event, AI Infra Meetup will be back on Thursday, May 9! 📢 Co-hosted by Alluxio and Uber, we bring leading AI/ML infrastructure experts to Uber Sunnyvale, where they will give talks and share insights about optimizing data pipelines, accelerating model training and serving, and designing scalable architectures. Mark your calendar for this premier opportunity to engage and discuss the latest AI/ML trends with industry professionals from Uber, Nvidia, UChicago, and more, and immerse yourself with learning, networking and conversations. 🍺 Food and Drinks are on us! 🍕 Learn More: https://buff.ly/4b2XQI1 #meetup #al #ml #machinelearning #infra #storage #compute #gpu #llm #modeltraining
AI/ML Infra Meetup at Uber | Alluxio