#
tei
Here are 200 public repositories matching this topic...
hand-written dictionaries from the FreeDict project
-
Updated
Jan 11, 2021 - C
Document Layout Analysis resources repos for development with PdfPig.
pdf
csharp
hocr
tei
hocr-documents
alto-xml
table-extraction
page-xml
alto
layout-analysis
document-layout-analysis
xycut
docstrum
pdfpig
xy-cut
recursive-xy-cut
page-segmentation
-
Updated
Mar 20, 2022 - C#
Kitodo.Presentation
solr
openlayers
typo3
jplayer
mods
digital-library
code4lib
tei
tei-xml
iiif
mets
apache-solr
kitodo
alto-xml
solarium
alto
mets-xml
mods-xml
kitodo-presentation
-
Updated
Mar 27, 2022 - PHP
Lili Elbe Digital Archive practicum - learning markup via an engaged markdown community. Visit our wiki!
github
git
markdown
xml
markup-language
creative-commons
course-project
digital-humanities
pedagogy
tei
tei-xml
undergraduate
digital-edition
transgender
undergraduate-students
undergraduate-education
digital-editions
digital-pedagogy
-
Updated
Dec 13, 2020 - CSS
a repository to help introduce and orient students to the GitHub collaboration environment, and to support DH classes.
javascript
css
svg
html
xml
regex
xslt
xpath
xquery
cytoscape
relaxng
tei
networkanalysis
regularexpressions
tei-odd
-
Updated
Aug 22, 2020 - HTML
PhiloLogic4
-
Updated
Mar 25, 2022 - Python
Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project
-
Updated
Feb 19, 2022 - Python
A highly customizable plugin for setting up and activating remote-driven autocompletions of attribute values in the oXygen XML Editor.
-
Updated
Oct 7, 2018 - Java
An Etymological DataBase (v2.1) - described in the LREC paper Methodological Aspects of Developing and Managing an Etymological Lexical Resource: Introducing EtymDB-2.0
database
extract
etymology
tei
wiktionary
cognates
etymology-data
wiktionary-parser
lrec2020
borrowings
-
Updated
Jan 4, 2022 - Perl
a repository for materials related to teaching and writing on technologies of up-conversion and project development with the XML family of languages, featuring regex, XPath, XQuery, XSLT, and Schematron.
-
Updated
Feb 25, 2022 - HTML
-
Updated
Feb 8, 2022 - XSLT
A web based browser for the Standard Music Font Layout
-
Updated
Jan 6, 2022 - XQuery
Tools for Humanities Research and Editing of Ancient Documents
-
Updated
Nov 15, 2018
Automated listing of repos in GitHub with XML files containing teiHeader. Find a project using TEI today!
-
Updated
Mar 27, 2022 - JavaScript
ODD files for documenting the Digital Edition of the Carl-Maria-von-Weber-Gesamtausgabe
-
Updated
Mar 1, 2022 - XSLT
Diachronic Spanish Sonnet Corpus. Canonical and minor authors in Spanish (Europe, America and Asia): 15th to 19th century
-
Updated
Jan 2, 2022
odt ► TEI, extract semantics from offices texts to XML/TEI
-
Updated
Jan 21, 2022 - XSLT
A digital edition of the 24 Probstücke of the Oberclasse by Johann Mattheson.
-
Updated
Mar 22, 2022 - JavaScript
La Biblioteca Electrónica Textual del Teatro en Español (BETTE) es una colección de textos teatrales de la Edad de Plata española, codificada en XML-TEI por el grupo GHEDI (ahora @HDAUNIR) de la Universidad Internacional de la Rioja (UNIR).
-
Updated
Nov 8, 2021 - Jupyter Notebook
Improve this page
Add a description, image, and links to the tei topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the tei topic, visit your repo's landing page and select "manage topics."
I have mostly tested
trafilaturaon a set of English, German and French web pages I had run into by surfing or during web crawls. There are definitely further web pages and cases in other languages for which the extraction doesn't work so far.Corresponding bug reports can either be filed as a list in an issue like this one or in the code as XPath expressions in [xpaths.py](https://github.com