Details
Wednesday April 9, 2025 at Thrive Workplace Centennial(9200 E Mineral Ave, Centennial, CO 80112)
5:30-6:00: Food, Soda, Beer, and Networking
6:00-6:15: Announcements
6:15-7:45: Linearly Scalable RAG Data Ingestion on Commodity Infrastructure
The Oracle Coherence team has developed a distributed parallel data ingestion solution for Retrieval Augmented Generation (RAG) use cases. The solution’s linear scalability allows users to choose the speed of data ingestion, including document loading and chunking, and vector embedding creation, by scaling out the Coherence cluster doing the work.
Faster data ingestion allows for more frequent and rapid experimentation with document chunking strategies, and LLM selection for vector embedding creation, to improve answer quality in RAG applications. It also allows for more frequent re-ingestion of changing knowledge corpuses. Initial vetting has validated that the Coherence-based solution is at least an order of magnitude faster and cheaper than the common approach of running custom Python programs on GPUs, which are hard to find. By running on commodity infrastructure, the Coherence-based solution avoids that problem entirely.
Exhibiting architectural flexibility, the Coherence-based solution allows the created vector embeddings to be stored in Oracle Database 23ai, in OpenSearch, in Coherence itself, or in other possible stores, for later searching during generative AI chat sessions. These strengths of the Coherence-based solution – ingestion speed and architectural flexibility – are based on unique Coherence features like distributed parallel processing, grid-side eventing, and data source integration.
If you are implementing RAG features in your company or software, you might be interested in this Coherence-based solution, to optimize the speed and cost of repetitive data ingestion in your use case.
About Randy Stafford
Randy Stafford is the product manager of Oracle Coherence since 2019, after twelve years of consulting and developing on it. He was formerly Chief Architect of IQNavigator in Denver, and a member of Rally Software’s technical advisory board. Randy has contributed to the enterprise architecture patterns literature, with chapters in Martin Fowler’s Patterns of Enterprise Application Architecture, and O’Reilly’s 97 Things Every Software Architect Should Know (also O’Reilly’s 97 Things Every Programmer Should Know, and Floyd Marinescu’s EJB Design Patterns way back in the day). An old Smalltalk programmer, he was an early member of the Domain Driven Design community and has presented at many software conferences over the years.
About Aleks Seović
Aleks Seović is an architect at Oracle, where he works on Oracle Coherence, a leading in-memory data grid product, and contributes to Helidon microservices framework. Most recently, Aleks led the design and implementation of the AI-related features in Coherence, such as low-level vector storage and indexing, as well as higher level features such as scalable document ingestion, vector embeddings creation, and vector search, as part of Coherence RAG. He also leads the implementation of Coherence native clients and microservices framework integrations. Prior to joining Oracle in 2014, Aleks led a boutique consultancy practice, where he worked with customers around the world to help them implement mission critical applications using Oracle Coherence. Aleks is the author of “Oracle Coherence 3.5” (Packt Publishing, 2010) and frequently speaks about and evangelizes Coherence at industry conferences, as well as Java User Group events.
7:45: Door prizes
Thanks to our sponsors!