IJFANS International Journal of Food and Nutritional Sciences

Volume 13 Issue 4

Effect of Biostimulants (Azospirillum, Pseudomonas and Bacillus) on the growth and disease suppression of neem Azadirachta indica (A) juss.seedlings
Volume 13 | Issue 4

“Pigments of Imagination & Color Psychology of Consumers towards Apparel: A Perceptual Study”
Volume 13 | Issue 4

Exploring The Relationship Between Weather Patterns and Energy Consumption in Smart Homes: A Regression Analysis
Volume 13 | Issue 4

DEEP LEARNING BASED APPROACH FOR BIRD SPECIES IDENTIFICATION AND CLASSIFICATION
Volume 13 | Issue 4

ML-DRIVEN WASTE CLASSIFICATION FOR EFFECTIVE ORGANIC AND NON-ORGANIC WASTE MANAGEMENT
Volume 13 | Issue 4

Critical Factors for Optimizing Large Multi-modal Models: Mage Resolution and Text Labeling

PDF

Keywords:

U. Harita,Tanaya Ganguly

Abstract

Large Multimodal Models have proven to be remarkably adept at comprehending tasks involving broad vision and language. However, these models frequently face difficulties when handling complex scene understandings and narratives because of the limited supported input resolution (e.g., 448 x 448) and the incomplete description of the training image-text combination.

Issue

Volume 8, Issue 3 (2019 )

Submit article

IJFANS International Journal of Food and Nutritional Sciences

Critical Factors for Optimizing Large Multi-modal Models: Mage Resolution and Text Labeling

Abstract

POLICIES & JOURNAL LINKS

Contact Us

IJFANS International Journal of Food and Nutritional Sciences

Critical Factors for Optimizing Large Multi-modal Models: Mage Resolution and Text Labeling

Article Sidebar

Main Article Content

Abstract

Article Details

POLICIES & JOURNAL LINKS

Contact Us