Skip to content
#

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Here are 16,920 public repositories matching this topic...

transformers
fcakyon
fcakyon commented Dec 6, 2021

🚀 Feature request

Make CLIPFeatureExtractor (or any FeatureExtractor in general) accept batch of images as torch.Tensor.

Motivation

Currently batch of images as torch.Tensor are not treated as a batch, it has to be a List[torch.Tensor] but it is not the case when using native Pytorch DataLoader. Can we update this line so that it accepts batches as torch.Tensor. Maybe we ca

rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

  • Updated Dec 14, 2021
  • Python
gensim
jina
Stubatiger
Stubatiger commented Nov 29, 2021

ENV

Python 3.9
jina 2.5.0

Describe the bug

If i try to dump an image blob to a io.bytesio object an error is thrown

from jina import Document
import io
d =  Document(uri='steam_data/image_store/8c/5b/8c5b265b9c533636.png')
output = io.BytesIO() 

(
    d
    .load_uri_to_image_blob()
    .dump_ima
datasets
danieldeutsch
danieldeutsch commented Jun 2, 2021

Is your feature request related to a problem? Please describe.
I typically used compressed datasets (e.g. gzipped) to save disk space. This works fine with AllenNLP during training because I can write my dataset reader to load the compressed data. However, the predict command opens the file and reads lines for the Predictor. This fails when it tries to load data from my compressed files.

Created by Alan Turing

Wikipedia
Wikipedia