Skip to content
#

protein-sequences

Here are 167 public repositories matching this topic...

ptynecki
ptynecki commented May 16, 2020

Hey,

I would propose extend list of available models by SeqVec (ELMo-based implementation) which was presented in the Modeling aspects of the language of life through transfer-learning protein sequences paper.

SeqVec model trained on UniRef50 is available at: [SeqVec-model](

enhancement good first issue help wanted wontfix
josruirod
josruirod commented Feb 11, 2021

Hi, and congratulations for the software. I want to give a try, and I managed to successfully install it (the conda environment and the step-by-step process is much appreciated, but there are few inaccuracies in the readme, such as FIND instead of find when running, or where is install.sh or the folder where the compressed files for the external software has to be downloaded).
When running the ex

good first issue
mortonjt
mortonjt commented Jul 3, 2020

We'll need to standardize the CPU / GPU version of needleman-wunsch.

  • Make sure that batching can be handled on the CPU (a hacked version is ok, will need to make sure that dimensions aren't an issue)
  • Make sure that the appropriate flags checking for devices are in place.
good first issue
amazon-sagemaker-protein-classification

Implementation of Protein Classification based on subcellular localization using ProtBert(Rostlab/prot_bert_bfd_localization) model from Hugging Face library, based on BERT model trained on large corpus of protein sequences.

  • Updated Feb 15, 2021
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the protein-sequences topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the protein-sequences topic, visit your repo's landing page and select "manage topics."

Learn more