Foundations, User Modeling, and Common Modality Combinations. Volume 1 EDITORS: Oviatt, Sharon; Schuller, Bjorn; Cohen, Philip R; Sonntag, Daniel; Potamianos, Gerasimos; Krueger, Antonio PUBLISHER: Morgan and Claypool/ACM Press. This is a THREE volume series that presents the definitive state of the art and future directions of the field of Multimodal and Multi-Sensor interfaces.
Natural Language Processing
A case study for contextualised image captioning uning foundation models: journalism enhancement with AI
Large language models (LLMs) and large multimodal models (LMMs) have significantly impacted the AI community, industry, and various economic sectors. In journalism, integrating AI poses unique challenges and opportunities, particularly in enhancing the quality and Read more…