image-captioning

@image-captioning

No description available 🫠

LAVIS

10.5k

@salesforce

LAVIS - A One-stop Library for Language-Vision Intelligence

InternGPT

3.2k

@OpenGVLab

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

a-PyTorch-Tutorial-to-Image-Captioning

2.8k

@sgrvinod

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Caption-Anything

1.7k

@ttengwang

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

Oscar

@microsoft

Oscar and VinVL

show-attend-and-tell

907

@yunjey

TensorFlow Implementation of "Show, Attend and Tell"

virtex

562

@kdexd

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

ComfyUI_VLM_nodes

488

@gokayfem

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

neuralmonkey

412

@ufal

An open-source tool for sequence learning in NLP built on TensorFlow.

Image-Caption-Generator

303

@dabasajay

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

show-adapt-and-tell

148

@tsenghungchen

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

redco

@tanyuqian

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference

image-captioning

@cobanov

Image captioning using python and BLIP

visuallydata

@diviz-mit

A large-scale curated dataset of Visual.ly infographics with metadata and additional crowdsourced annotations for research applications in computer vision and natural language processing.

bleu-rouge-meteor-cider-spice-eval4imagecaption

@Aldenhovel

Evaluation tools for image captioning. Including BLEU, ROUGE-L, CIDEr, METEOR, SPICE scores.

AI-for-Web-Accessibility

@ShaunakSen

This is the GitHub repository for my Masters dissertation titled: Artificial Intelligence for Web Accessibility which I completed as a part of my MSc in Data Science course in the University of Southampton, UK under the supervision of Prof. Mike Wald

ImageCaption-UnderFitting

@wangheda

Image caption models using visual attention and reinforcement learning (The 4th place solution to the AIChallenger Contest, Image Caption Track by team xiaoquexing)

image-captioner

@parask11

Generates suitable captions for the images of people and animals input by the user.

image-caption-cpp

@donydchen

A data driven query expansion approach for image caption, implemented in cpp

TensorRT-Toolkits

@Haoming02

Blazingly fast deep learning inference via TensorRT acceleration