#

html2text

Here are 19 public repositories matching this topic...

jaytaylor / html2text

Star

Golang HTML to plaintext conversion library

go golang html-emails plaintext html2text

Updated May 10, 2020
Go

weblyzard / inscriptis

Star

A python based HTML to text conversion library, command line client and Web service.

python html client converter library html2text web-service

Updated Jun 16, 2020
HTML

adbar / trafilatura

Star

Open

Check the language, clarity and consistency of documentation

adbar commented Jan 9, 2020

A short version of the documentation is available straight from Github (README.rst) while a more exhaustive one is present in the docs folder and online on trafilatura.readthedocs.io

Several problems could arise:

Non-idiomatic use of English (not quite fluent or natural)
Unclear or inc

Read more

good first issue up for grabs

Open

Test trafilatura on further web pages and report bugs

Open

Read settings from user-provided file

voku / html2text

Star

📝 Html2Text - Convert HTML to formatted plain text, e.g. for text mails

Updated Feb 15, 2019
PHP

RxNLP / nlp-cloud-apis

Star

RxNLP APIs for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or URL, computing similarity between texts and more.

python nlp wrapper natural-language-processing text-mining nlp-apis mashape html2text topic-extraction sentence-clustering opinosis-summarization rxnlp-apis xmashape-key

Updated Jan 24, 2020

zacanger / html2txt

Star

html2text but in node

html markdown cli node html2text

Updated Jun 13, 2020
JavaScript

dreipunktnull / twig-extensions

Star

A collection of useful, generic twig extensions.

twig twig-extension symfony html2text

Updated Jun 22, 2018
PHP

rubix1138 / html2text

Star

html2text Search Command for Splunk

python splunk html2text splunk-enterprise splunk-application splunk-searches

Updated Mar 4, 2019
Python

acapitanelli / web-utils

Star

A few python tools for web and text processing.

web download text fingerprinting html2text duplicate-detection

Updated Sep 2, 2019
Python

cycloidio / docker-image-html2text

Star

Dockerized html2text command-line tool

docker tool html2text

Updated Mar 18, 2019
Makefile

LukaszNiewinski / Microservice-for-retrieving-img-and-text

Star

Microservice for text and images collection for data science purposes.

python api docker flask service docker-compose scrapy html2text

Updated Mar 3, 2020
Python

cycloidio / docker-image-python-html2text

Star

Dockerized Python html2text command-line tool

html docker tool text html2text

Updated Mar 15, 2019
Makefile

erayon / PubMed

Star

This project involves building a robust classifier that classifies whether a document (from abstract content) belongs to cancer class or not.

html xml sklearn nltk xgboost beautifulsoup html2text svm-classifier

Updated Nov 7, 2017
HTML

AbdellatifCHE / Collect_Store_Search

Star

The goal is to create a solution that crawls for articles from a news website (Theguardian), cleanses the response, stores it in a hosted mongo database (MongoDB Atlas), then makes it available to search via an API.

python mongodb pymongo nltk scrapy html2text lemmatization

Updated Mar 3, 2020
Python

oguzhanlarca / web2pcat

Star

html2text and pygments

python osx python-script pygments python3 python-3 system-programming lecture-notes html2text eecs pcat pygmentize sistem-programlama sakarya sakarya-universitesi

Updated Aug 26, 2019
Python

hcq0618 / html-files-to-markdown-files

Star

batch convert html files to mardown files

html html2text mardown

Updated May 17, 2019
Python

importcjj / go-readability

Star

Go package that cleans a HTML page for better readability.

go html golang text extractor text-extraction readability html2text html-extractor

Updated Nov 5, 2019
HTML

afeiship / next-html2text

Star

Strip html to text for next.

html text strip html2text

Updated May 14, 2020
JavaScript

gsdefender / packtpub_telegram_bot

Star

Receive Packt Publishing Ltd. Free Learning updates in Telegram every day

telegram telegram-bot selenium packtpub html2text selenium-python

Updated May 16, 2020
Python

Improve this page

Add a description, image, and links to the html2text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the html2text topic, visit your repo's landing page and select "manage topics."

You can’t perform that action at this time.