Advances in Multimodal Information Retrieval and Generation

£49.99

Advances in Multimodal Information Retrieval and Generation

Electronics engineering Data warehousing Data mining Information retrieval Artificial intelligence Expert systems / knowledge-based systems Machine learning Computer vision Image processing

Authors: Man Luo, Tejas Gokhale, Neeraj Varshney, Yezhou Yang, Chitta Baral

Dinosaur mascot

Collection: Synthesis Lectures on Computer Vision

Language: English

Published by: Springer

Published on: 25th June 2024

Format: LCP-protected ePub

ISBN: 9783031578168


Overview

This book provides an extensive examination of state-of-the-art methods in multimodal retrieval, generation, and the pioneering field of retrieval-augmented generation. The work is rooted in the domain of Transformer-based models, exploring the complexities of blending and interpreting the intricate connections between text and images. The authors present cutting-edge theories, methodologies, and frameworks dedicated to multimodal retrieval and generation, aiming to furnish readers with a comprehensive understanding of the current state and future prospects of multimodal AI. As such, the book is a crucial resource for anyone interested in delving into the intricacies of multimodal retrieval and generation. Serving as a bridge to mastering and leveraging advanced AI technologies in this field, the book is designed for students, researchers, practitioners, and AI aficionados alike, offering the tools needed to expand the horizons of what can be achieved in multimodal artificial intelligence.

Show moreShow less