Issues: explosion/spaCy
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Kyrgyz language stopwords
lang / ky
Kyrgyz language data and models
resolved
The issue was addressed / answered
#12632
opened May 13, 2023 by
GrossmanLess
NER fails on warning "Token indices sequence length is longer than the specified maximum"
perf / accuracy
Performance: accuracy
#12622
opened May 11, 2023 by
schudoku
support pydantic 1.10.x+
third-party
Third-party packages and services
#12611
opened May 8, 2023 by
achapkowski
Installation issue on old macOSes for new Korean tokenizer in v4.0 alpha
lang / ko
Korean language data and models
#12416
opened Mar 14, 2023 by
BLKSerene
Displacy visualiser only sometimes shows labels
feat / visualizers
Feature: Built-in displaCy and other visualizers
#12411
opened Mar 13, 2023 by
goonhoon
Training transformer model goes from score 0.97 to ZERO
bug
Bugs and behaviour differing from documentation
feat / ner
Feature: Named Entity Recognizer
feat / training
Feature: Training utils, Example, Corpus and converters
feat / transformer
Feature: Transformer
perf / memory
Performance: memory use
#12383
opened Mar 8, 2023 by
svlandeg
Model suggestions for Danish (Norwegian and Swedish) Transformer
feat / transformer
Feature: Transformer
lang / da
Danish language data and models
lang / nb
Norwegian (Bokmål) language data and models
lang / sv
Swedish language data and models
#12376
opened Mar 7, 2023 by
KennethEnevoldsen
Not all displaCy templates can be overridden
enhancement
Feature requests and improvements
feat / visualizers
Feature: Built-in displaCy and other visualizers
#12267
opened Feb 9, 2023 by
drnextgis
Incorrect tokenization of dash punctuation in Spanish when not preceded or followed by a space
feat / tokenizer
Feature: Tokenizer
lang / es
Spanish language data and models
#12154
opened Jan 23, 2023 by
creolio
Access NEL prediction scores across KB candidates
enhancement
Feature requests and improvements
feat / nel
Feature: Named Entity linking
#12048
opened Jan 3, 2023 by
Luis-R-Flores
Mismatched IDs error when using nlp.rehearse with listeners
bug
Bugs and behaviour differing from documentation
feat / textcat
Feature: Text Classifier
training
Training and updating models
#12044
opened Jan 2, 2023 by
thomashacker
Doc span group spans aren't adjusted for retokenization
bug
Bugs and behaviour differing from documentation
feat / doc
Feature: Doc, Span and Token objects
feat / tokenizer
Feature: Tokenizer
#12024
opened Dec 24, 2022 by
kinghuang
spacy package CLI command accepts list of code_paths, but the others do not
enhancement
Feature requests and improvements
feat / cli
Feature: Command-line interface
feat / ux
Feature: User experience, error messages etc.
#12000
opened Dec 19, 2022 by
kinghuang
Inconsistent NER predictions from identical inputs while using ThreadPoolExecutor
reproducibility
Consistency, reproducibility, determinism, and randomness
scaling
Scaling, serving and parallelizing spaCy
third-party
Third-party packages and services
#11868
opened Nov 25, 2022 by
pege345
Problem with new dependency checking mechanism in spacy 3.4.2
enhancement
Feature requests and improvements
projects
spaCy projects and project templates
#11773
opened Nov 8, 2022 by
b2m
Italian tagger and lemmatizer performance dropped with the new v3.4 version
feat / lemmatizer
Feature: Rule-based and lookup lemmatization
feat / tagger
Feature: Part-of-speech tagger
lang / it
Italian language data and models
perf / accuracy
Performance: accuracy
#11298
opened Aug 12, 2022 by
databill86
Tokenizer uses a significant amount of memory compared to the input
feat / doc
Feature: Doc, Span and Token objects
feat / tokenizer
Feature: Tokenizer
perf / memory
Performance: memory use
🔜 v4.0
Related to upcoming v4.0
#11295
opened Aug 11, 2022 by
itamarst
Problems and errors in new German lemmatizer (since 3.3.0)
feat / lemmatizer
Feature: Rule-based and lookup lemmatization
lang / de
German language data and models
#10953
opened Jun 13, 2022 by
lutz-100worte
Executing a none python script using "Spacy Projects" generates an error
projects
spaCy projects and project templates
windows
Issues related to Windows
#10845
opened May 25, 2022 by
dhirajsuvarna
IndexError E040 when using senter
feat / doc
Feature: Doc, Span and Token objects
#10801
opened May 13, 2022 by
rutgerjv
Language.factory cannot be a subclass without nlp, name args
feat / pipeline
Feature: Processing pipeline and components
#10611
opened Apr 3, 2022 by
BramVanroy
Spacy split the sentence when I try to change the head of a root token
experimental
Experimental components and features
feat / doc
Feature: Doc, Span and Token objects
#10526
opened Mar 19, 2022 by
zsozso21
de_dep_news_trf uses 1990s Spelling Convention in Lemmatization
feat / lemmatizer
Feature: Rule-based and lookup lemmatization
lang / de
German language data and models
#9799
opened Dec 3, 2021 by
hatzel
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.