Mutual Information Preference Optimization for Robust Multi- Modal Recipe Generation

Shaposhnyk M.; Minukhin S.

Будь ласка, використовуйте цей ідентифікатор, щоб цитувати або посилатися на цей матеріал: https://repository.hneu.edu.ua/handle/123456789/40026

Повний запис метаданих

Поле DC	Значення	Мова
dc.contributor.author	Shaposhnyk M.	-
dc.contributor.author	Minukhin S.	-
dc.date.accessioned	2026-05-14T21:17:59Z	-
dc.date.available	2026-05-14T21:17:59Z	-
dc.date.issued	2026	-
dc.identifier.citation	Shaposhnyk M. Mutual Information Preference Optimization for Robust Multi- Modal Recipe Generation / M. Shaposhnyk, S. Minukhin // Сучасні інформаційні технології та системи штучного інтелекту MIT&AIS-2026 : матеріали 2-ї Міжнародної науково-практичної конференції, 27-29 квітня 2026 р. Харків – Яремче, Україна. – Харків, 2026. – С. 113-117.	uk_UA
dc.identifier.uri	https://repository.hneu.edu.ua/handle/123456789/40026	-
dc.description.abstract	This study evaluates the impact of Mutual Information Preference Optimization (MIPO) as a corrective layer within a hybrid vision-language architecture. Rather than introducing a new standalone framework, the research modifies an existing multimodal pipeline by integrating MIPO to bridge the operational gap between a DenseNet-121 ensemble and Llama 3.1 8B. The central hypothesis—that LLMs can act as autonomous semantic filters—was tested through contrastive alignment, which synchronizes CNN-derived visual features with the textual latent space. Experimental results on the Food-101 dataset validate this modification, demonstrating that the system can successfully suppress false-positive detections without a complete retraining of the visual backbone. By filtering out incongruous artifacts through preference optimization, the modified architecture achieved a 60,8% reduction in semantic hallucinations. This confirms the viability of using LLMs for real-time error correction in specialized domains, such as personalized dietetics, where output fidelity is a critical requirement.	uk_UA
dc.language.iso	en	uk_UA
dc.subject	Food Computing	uk_UA
dc.subject	Llama 3.1	uk_UA
dc.subject	DenseNet-121	uk_UA
dc.subject	Mutual Information Preference Optimization	uk_UA
dc.subject	visual grounding	uk_UA
dc.title	Mutual Information Preference Optimization for Robust Multi- Modal Recipe Generation	uk_UA
dc.type	Article	uk_UA
Розташовується у зібраннях:	Статті (ІС)

Файли цього матеріалу:

Файл	Опис	Розмір	Формат
MIT&AIS_2026_main.....pdf		1,54 MB	Adobe PDF	Переглянути/відкрити

Показати базовий опис матеріалу Перегляд статистики

Усі матеріали в архіві електронних ресурсів захищені авторським правом, всі права збережені.