Here are
239 public repositories
matching this topic...
🔮 The most advanced MLOps platform for multimodal AI on the cloud · Neural Search · Creative AI · Cloud Native
Updated
Nov 28, 2022
Python
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Updated
Nov 22, 2022
Python
🪩 Create Disco Diffusion artworks in one line
Updated
Oct 1, 2022
Python
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Updated
Nov 28, 2022
Jupyter Notebook
🧬 The data structure for unstructured multimodal data · Neural Search · Vector Search · Document Store
Updated
Nov 28, 2022
Python
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Updated
Nov 26, 2022
Python
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Updated
Nov 26, 2022
Python
A curated list of Multimodal Related Research.
Updated
Oct 30, 2022
Python
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Updated
Nov 21, 2022
Python
Easily compute clip embeddings and build a clip retrieval system with them
Updated
Nov 20, 2022
Jupyter Notebook
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
CVPR 2019: "Pluralistic Image Completion"
Updated
Jul 29, 2022
Python
Transformers at any scale
Updated
Nov 27, 2022
Python
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
Updated
Jul 16, 2022
Python
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
Updated
Nov 17, 2022
Python
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Updated
Jun 1, 2022
Python
Open-AI's DALL-E for large scale training in mesh-tensorflow.
Updated
Feb 12, 2022
Python
Platform for Situated Intelligence
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
Updated
Feb 8, 2022
Python
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
Updated
Jun 14, 2022
Jupyter Notebook
Improve this page
Add a description, image, and links to the
multimodal
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
multimodal
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.