#
multimodal
Here are 125 public repositories matching this topic...
A curated list of Multimodal Related Research.
-
Updated
Jul 8, 2021 - Python
CVPR 2019: "Pluralistic Image Completion"
-
Updated
Sep 20, 2019 - Python
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
-
Updated
Apr 23, 2021
Open-AI's DALL-E for large scale training in mesh-tensorflow.
transformers
artificial-intelligence
autoregressive
text-to-image
variational-autoencoder
multimodal
-
Updated
Apr 6, 2021 - Python
Platform for Situated Intelligence
streaming
framework
pipelines
artificial-intelligence
stream-processing
perception
component-library
human-robot-interaction
multimodal-interactions
multimodal
-
Updated
Jul 22, 2021 - C#
KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall first place
-
Updated
Jul 22, 2020 - Jupyter Notebook
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
search
retrieval
ranking
clip
multimodality
multimodal-learning
multimodal
activitynet
retrieval-model
msvd
msrvtt
video-text-retrieval
lsmdc
didemo
video-clip-retrieval
-
Updated
Jul 13, 2021 - Python
EPNet: Enhancing Point Features with Image Semantics for 3D Object Detection(ECCV 2020)
-
Updated
Aug 25, 2020 - Python
Fusing Histology and Genomics via Deep Learning - IEEE TMI
genomics
fusion
transcriptomics
pathology
multimodal
histopathology
computational-pathogenomics
pathomic
multimodal-network
mahmoodlab
-
Updated
May 26, 2021 - Jupyter Notebook
-
Updated
Dec 2, 2020 - Python
第五届百度西安交大大数据竞赛 城市区域功能分类 Baseline
-
Updated
Jun 20, 2020 - Jupyter Notebook
CVPR 2021: "Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE"
tensorflow
attention
generative-adversarial-networks
inpainting
multimodal
vq-vae
autoregressive-neural-networks
-
Updated
Jul 11, 2021 - Python
[CVPR2020] Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation
deep-learning
cnn
pytorch
multi-modal
image-registration
affine-transformation
stn
image-to-image-translation
multimodal
deformable-transformation
multi-modal-learning
cvpr2020
registartion
multimodal-image-registration
-
Updated
Aug 2, 2020 - Python
Neural Machine Translation with universal Visual Representation (ICLR 2020)
-
Updated
Jul 1, 2020 - Python
-
Updated
Oct 6, 2020
Graph Distillation for Action Detection
-
Updated
Jul 15, 2019 - Python
Robust multimodal integration method implemented in PyTorch and TensorFlow
-
Updated
Mar 5, 2021 - Python
Tensorflow implementation of "Deep Multimodal Subspace Clustering Networks"
-
Updated
May 10, 2019 - Python
ADvISER is a flexible framework to encourage task-oriented dialog system research & development
machine-learning
framework
reinforcement-learning
toolkit
dialogue
dialogue-systems
task-oriented-dialogue
multimodal
-
Updated
Jul 23, 2021 - Python
Multi-modal speech separation task data generation script on LRS3 data set.
-
Updated
Jul 29, 2020 - MATLAB
Open Scripts and pipelines from the Multimodal Imaging and Connectome Analysis Lab at the Montreal Neurological Institute
machine-learning
neuroscience
neuroimaging
networks
gradients
connectomics
histology
multimodal
multi-scale
-
Updated
Jul 9, 2021 - MATLAB
CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)
-
Updated
Jan 19, 2021 - Python
Pytorch Implementation of "Adaptive Co-attention Network for Named Entity Recognition in Tweets" (AAAI 2018)
-
Updated
Jun 9, 2021 - Python
Modality-Transferable-MER, multimodal emotion recognition model with zero-shot and few-shot abilities.
-
Updated
Apr 23, 2021 - Python
A complete pipeline for BraTS 2020
-
Updated
Aug 6, 2020 - Python
Multimodal Hashtag Prediction with instagram data & pytorch (2nd Place on OpenResource Hackathon 2019)
-
Updated
May 21, 2021 - Python
[CVPR 2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
paper
annotations
dataset
vqa
cvpr
vqa-dataset
traffic-events
multimodal
multimodal-deep-learning
cvpr2021
video-reasoning
-
Updated
Jul 23, 2021 - JavaScript
Improve this page
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."
File "/home/ubuntu/vqa/GMN/mmf/mmf/datasets/builders/visual_genome/dataset.py", line 44, in init
scene_graph_file = self._get_absolute_path(scene_graph_file)
AttributeError: 'VisualGenomeDataset' object has no attribute '_get_absolute_path'
Command that i run in shell
CUDA_VISIBLE_DEVICES="0" mmf_run config=projects/gmn/configs/visual_genome/defaults.yaml model=gm