Skip to content
#

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

Here are 25,005 public repositories matching this topic...

transformers
penpaperkeycode
penpaperkeycode commented Jun 2, 2022

Feature request

Is the addition of the 'OPTforSequenceClassification' class scheduled?
Is someone handling it?
When adding these functions, I wonder if it is possible to PR one by one, or if I have to PR all classes supported by other models.

Motivation

Added function of OPT class, which is being actively discussed recently

Your contribution

I personally use the forSequenceCla

jeremiedbb
jeremiedbb commented May 25, 2022

PR #22722 introduced a common method for the validation of the parameters of an estimator. We now need to use it in all estimators.

Please open one PR per estimator or family of estimators (if one inherits from another). The title of the PR should mention which estimator it's dealing with and the description of the PR should begin with towards #.

Steps

  • The estimator must define a cl
Easy good first issue Meta-issue

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Apr 3, 2022
  • Python
VishDev12
VishDev12 commented Jun 4, 2022

What happened + What you expected to happen

When initializing a Ray Trainer, we provide a logdir argument, and the __init__ method of the Trainer stores it as a logdir class variable.

Then, when creating a Trainable with Trainer.to_tune_trainable(), it in-turn calls _create_tune_trainable(), which does not use self.logdir. So when tune_function is defined inside `_create_tu

bug good first issue P3 triage
asaini
asaini commented Oct 1, 2021

Problem

See #3856 . Developer would like the ability to configure whether the developer menu or viewer menu is displayed while they are developing on cloud IDEs like Gitpod or Github Codespaces

Solution

Create a config option

showDeveloperMenu: true | false | auto

where

  • true: always shows the developer menu locally and while deployed
  • false: always sho
enhancement good first issue
pytorch-lightning
awaelchli
awaelchli commented Apr 17, 2022

Proposed refactor

The current import time for the pytorch_lightning package on my machine is several seconds. There are some opportunities to improve this.

Motivation

High import times have an impact on the development and debugging speed.

Benchmark

I benchmarked the import time in two environments:

  1. Fresh environment with pytorch lightning installed, no extras.
jina
divyasreedev
divyasreedev commented Mar 14, 2022

Problem: It is pretty challenging to find resource material and valuable articles, videos and such, and we spend a lot of time searching and finding the appropriate resource for us.

Proposed solution: Faceted search can come a long way when looking for a quick way to find a solution designed for our needs. Ratings on the resource can help us select the best solution based on our search

good first issue
rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

  • Updated Jun 13, 2022
  • Python
AnirudhDagar
AnirudhDagar commented Jan 24, 2022

Although the results look nice and ideal in all TensorFlow plots and are consistent across all frameworks, there is a small difference (more of a consistency issue). The result training loss/accuracy plots look like they are sampling on a lesser number of points. It looks more straight and smooth and less wiggly as compared to PyTorch or MXNet.

It can be clearly seen in chapter 6([CNN Lenet](ht

tensorflow-adapt-track good first issue
datasets
lhoestq
lhoestq commented Jun 28, 2021

As mentioned in huggingface/datasets#2552 it would be nice to improve the error message when a dataset fails to build because there are duplicate example keys.

The current one is

datasets.keyhash.DuplicatedKeysError: FAILURE TO GENERATE DATASET !
Found duplicate Key: 48
Keys should be unique and deterministic in nature

and we could have something

enhancement good first issue
gensim
mpenkov
mpenkov commented Jun 22, 2021

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi
bug difficulty easy good first issue fasttext
Alec-Stashevsky
Alec-Stashevsky commented Feb 15, 2022

Thank you for submitting a feature request. Before proceeding, please review MLflow's Issue Policy for feature requests and the MLflow Contributing Guide.

**Please fill in this feature request template to ensure a timely and thorough response.

enhancement good first issue help wanted area/tracking
Wikipedia
Wikipedia