pytorch

It looks like our --label_smoothing_factor Trainer's feature doesn't handle fp16 well. It's a problem with the deepspeed zero3 I'm integrating right now, since it evals in fp16, but also can be reproduced with the recently added --fp16_full_eval trainer option.

To reproduce:

export BS=16; rm -r output_dir; PYTHONPATH=src USE_TF=0 CUDA_VISIBLE_DEVICES=0 python examples/seq2seq/run_seq2

We keep this issue open to collect feature requests from users and hear your voice. Our monthly release plan is also available here.

You can either:

Suggest a new feature by leaving a comment.
Vote for a feature request with 👍 or be against with 👎. (Remember that developers are busy and cannot respond to all feature requests, so vote for your most favorable one!)
Tell us that

Currently, we rely on AllGatherGrad to compute gather for GPUs.

TODO:

[] Extend this class to support TPU
[] Add tests

Change tensor.data to tensor.detach() due to
pytorch/pytorch#6990 (comment)
tensor.detach() is more robust than tensor.data.

Add a new API for converting a model to external data. Today the conversion happens in 2 steps
external_data_helper.convert_model_to_external_data(<model>, <all_tensors_to_one_file>, <size_threshold>) save_model(model, output_path)
We want to add another api which combines the 2 steps
`
save_model_to_external_data(, <output_

While setting train_parameters to False very often we also may consider disabling dropout/batchnorm, in other words, to run the pretrained model in eval mode.
We've done a little modification to PretrainedTransformerEmbedder that allows providing whether the token embedder should be forced to eval mode during the training phase.

Do you this feature might be handy? Should I open a PR?

I'm using mxnet to do some work, but there is nothing when I search the mxnet trial and example.

Current pytorch implementation ignores the argument split_f in the function train_batch_ch13 as shown below.

def train_batch_ch13(net, X, y, loss, trainer, devices):
    if isinstance(X, list):
        # Required for BERT Fine-tuning (to be covered later)
        X = [x.to(devices[0]) for x in X]
    else:
        X = X.to(devices[0])
...

Todo: Define the argument `

Please can you train ghostnet.
(i don't have the imagenet dataset)

Is it possible to run this on a (recent) Mac, which does not support CUDA? I would have guessed setting --GPU 0 would not attempt to call CUDA, but it fails.

File "/Users/../Desktop/bopbtl/venv/lib/python3.7/site-packages/torch/cuda/__init__.py", line 61, in _check_driver
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enable

pytorch

Here are 14,806 public repositories matching this topic...

huggingface / transformers

GokuMohandas / madewithml

CorentinJ / Real-Time-Voice-Cloning

fastai / fastai

yunjey / pytorch-tutorial

junyanz / pytorch-CycleGAN-and-pix2pix

open-mmlab / mmdetection

zergtant / pytorch-handbook

lutzroeder / netron

PyTorchLightning / pytorch-lightning

ShusenTang / Dive-into-DL-PyTorch

bharathgs / Awesome-pytorch-list

pytorch / fairseq

dragen1860 / Deep-Learning-with-TensorFlow-book

Tencent / ncnn

JaidedAI / EasyOCR

horovod / horovod

rusty1s / pytorch_geometric

flairNLP / flair

onnx / onnx

allenai / allennlp

ultralytics / yolov5

microsoft / nni

d2l-ai / d2l-en

deepinsight / insightface

chenyuntc / pytorch-book

graykode / nlp-tutorial

rwightman / pytorch-image-models

ritchieng / the-incredible-pytorch

microsoft / Bringing-Old-Photos-Back-to-Life

Improve this page

Add this topic to your repo