LLM + Vector Data @ ICDE 2025

International Workshop on Coupling of Large Language Models with Vector Data Management

In conjunction with the 41th IEEE International Conference on Data Engineering (ICDE 2025)

Monday, May 19th 2025, Hong Kong, China

mylogo.jpg

The emergence of generative AI (GenAI) is a major driving force behind the modern data science ecosystem, a field that exploits data as the central asset for actionable insights. Analogously, GenAI is a form of artificial intelligence which learns from massive datasets to generate new data, showcasing human-like creativity in text, images to code, speech, and video. Two critical pillars of the GenAI technology are large language models (LLMs) and vector data. In particular, LLMs are a category of genAI models that emphasize on generating new text contents. On the other hand, there is also an upsurge of dense high-dimensional, billion-scale vector data from deep learning models that embed complex data, e.g., text, multimedia, graphs, and tables into vector representations aiming to preserve semantic similarity. Since LLMs operate on vector data at various stages consisting of pre-training, fine-tuning, inference, and retrievalaugmented generation (RAG), coupling large language models with vector data management is essential for enhancing data science services with cross-modal data querying and generation. It creates new opportunities and challenges in areas such as accuracy, consistency, efficiency, scalability, privacy, fairness, explainability, data regulations, software-hardware collaboration, and cloud-native systems. The workshop aims to advance the understanding of how LLMs and vector data management can cooperatively contribute to data science solutions.

news

Oct 15, 2024 LLM + Vector Data Workshop website released. :sparkles: :smile:
Sep 25, 2024 LLM + Vector Data Workshop proposal accepted.
Sep 14, 2024 LLM + Vector Data Workshop proposal submission.