@image-captioning
No description available 🫠
10.5k
@salesforce
LAVIS - A One-stop Library for Language-Vision Intelligence
3.2k
@OpenGVLab
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
2.8k
@sgrvinod
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
1.7k
@ttengwang
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
1k
@microsoft
Oscar and VinVL
907
@yunjey
TensorFlow Implementation of "Show, Attend and Tell"
562
@kdexd
[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations
488
@gokayfem
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
412
@ufal
An open-source tool for sequence learning in NLP built on TensorFlow.
303
@dabasajay
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
148
@tsenghungchen
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
64
@tanyuqian
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
47
@cobanov
Image captioning using python and BLIP
31
@diviz-mit
A large-scale curated dataset of Visual.ly infographics with metadata and additional crowdsourced annotations for research applications in computer vision and natural language processing.
28
@Aldenhovel
Evaluation tools for image captioning. Including BLEU, ROUGE-L, CIDEr, METEOR, SPICE scores.
11
@ShaunakSen
This is the GitHub repository for my Masters dissertation titled: Artificial Intelligence for Web Accessibility which I completed as a part of my MSc in Data Science course in the University of Southampton, UK under the supervision of Prof. Mike Wald
10
@wangheda
Image caption models using visual attention and reinforcement learning (The 4th place solution to the AIChallenger Contest, Image Caption Track by team xiaoquexing)
6
@parask11
Generates suitable captions for the images of people and animals input by the user.
4
@donydchen
A data driven query expansion approach for image caption, implemented in cpp
3
@Haoming02
Blazingly fast deep learning inference via TensorRT acceleration