-
Updated
Mar 1, 2023 - Python
multimodal
Here are 279 public repositories matching this topic...
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
-
Updated
Mar 1, 2023 - Python
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
-
Updated
Feb 2, 2023 - Python
🪩 Create Disco Diffusion artworks in one line
-
Updated
Dec 11, 2022 - Python
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
-
Updated
Feb 28, 2023 - Jupyter Notebook
notes for software engineers getting up to speed on new AI developments. Serves as datastore for lspace.swyx.io writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.
-
Updated
Mar 1, 2023
In-App assistant SDK to build a multimodal conversational UX for websites and web apps (JavaScript, React, Angular, Vue, Ember, Electron)
-
Updated
Feb 12, 2023
-
Updated
Mar 1, 2023 - Python
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
-
Updated
Mar 1, 2023 - Python
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
-
Updated
Feb 2, 2023 - Python
In-App assistant SDK to build a multimodal conversational UX for iOS applications (Swift, Objective-C)
-
Updated
Feb 15, 2023 - Objective-C
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
-
Updated
Feb 23, 2023 - Python
In-App assistant SDK to build a multimodal conversational UX for Android applications (Java, Kotlin)
-
Updated
Dec 23, 2022
In-App assistant SDK to build a multimodal conversational UX for applications created with Flutter (iOS and Android)
-
Updated
Jan 15, 2023 - Ruby
In-App assistant SDK to build a multimodal conversational UX for applications created with Ionic (React, Angular, Vue)
-
Updated
Feb 2, 2023 - TypeScript
Transformers at any scale
-
Updated
Jan 19, 2023 - Python
Easily compute clip embeddings and build a clip retrieval system with them
-
Updated
Feb 17, 2023 - Jupyter Notebook
A curated list of Multimodal Related Research.
-
Updated
Oct 30, 2022 - Python
In-App assistant SDK to build a multimodal conversational UX for Apache Cordova applications
-
Updated
Nov 10, 2022 - Ruby
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
-
Updated
Sep 30, 2022
Improve this page
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."