Zum Anfang der Bildgalerie springen

Image Caption

Name: Image Caption
SKU: DACUPBSGBB3
Price: 59.95 CHF
Availability: InStock

Seien Sie der Erste, der dieses Produkt bewertet

CHF 59.95

Auf Lager

SKU

DACUPBSGBB3

1 Verfügbar

Kostenloser Versand

Geliefert zwischen Mi., 22.04.2026 und Do., 23.04.2026

Details

Image captioning with audio has emerged as a challenging yet promising task in the field of deep learning. This paper proposes a novel approach to address this task by integrating convolutional neural networks (CNNs) for image feature extraction and recurrent neural networks (RNNs) for sequential audio analysis. Specifically, we leverage pre-trained CNNs such as VGG to extract visual features from images and employ spectrogram representations coupled with RNNs such as LSTM or GRU to process audio inputs. Our proposed model based not only on their visual content but also on accompanying audio cues. We evaluate the performance of our model on benchmark datasets and demonstrate its effectiveness in generating coherent and contextually relevant captions for images with corresponding audio inputs. Additionally, we conduct tablation studies to analyze the contribution of each modality to the overall captioning performance, our results show that the fusion of visual and auditory modalities significantly improves captioning quality compared to using either modality in isolation.

Autorentext
Ms K.Kanchana is working as an Assistant Professor in Computer Science and Engineering Department at Kathir College of Engineering. She is interested in the area of Machine Learning and Deep Learning.

30 Tage Rückgaberecht

Weitere Informationen

Allgemeine Informationen
- GTIN 09786207647606
- Genre Business Encyclopedias
- Sprache Englisch
- Anzahl Seiten 64
- Herausgeber LAP LAMBERT Academic Publishing
- Größe H220mm x B150mm x T4mm
- Jahr 2024
- EAN 9786207647606
- Format Kartonierter Einband
- ISBN 6207647602
- Veröffentlichung 16.05.2024
- Titel Image Caption
- Autor Kanchana Kannaiyan , Meenatchi R
- Untertitel Image Caption using Deep learning
- Gewicht 113g

Bewertungen

Schreiben Sie eine Bewertung

Nur registrierte Benutzer können Bewertungen schreiben. Bitte loggen Sie sich ein oder erstellen Sie ein Konto.