A case study for contextualised image captioning uning foundation models: journalism enhancement with AI

Large language models (LLMs) and large multimodal models (LMMs) have significantly impacted the AI community, industry, and various economic sectors. In journalism, integrating AI poses unique challenges and opportunities, particularly in enhancing the quality and efficiency of news reporting. This study explores how LLMs and LMMs can assist journalistic practice Read more…

Towards self-improving scene understanding with vision-language knowledge integration

Image captioning has seen immense progress in the last few years. However, general-purpose systems often fail to provide personalised, context-aware captions tailored to individual users or domains. In this work, we investigate the task of personalised and contextualised image captioning by leveraging foundational models, including large language models (LLMs) and Read more…

Investigating Natural Language Inference Capabilities of Large Language Modes in Biomedical Claim Verification 

Left: Examples from HealthVer [1]; Right: Example of a claim that is supported and refuted by different evidence [2]  With the rapid growth of biomedical research and the concurrent rise in misinformation, ensuring the accuracy of claims about treatment effectiveness is increasingly critical. Inaccurate or misleading information can have profound Read more…