#
data-transformation
Here are 130 public repositories matching this topic...
data-science
machine-learning
spark
bigdata
data-transformation
pyspark
data-extraction
data-analysis
data-wrangling
dask
data-exploration
data-preparation
data-cleaning
data-profiling
data-cleansing
big-data-cleaning
data-cleaner
cudf
dask-cudf
-
Updated
Jul 26, 2021 - Python
A block-based API for NSValueTransformer, with a growing collection of useful examples.
-
Updated
Mar 23, 2020 - Objective-C
library
framework
asynchronous
php-development
scalability
porter
data-import
data-transformation
abstraction
durability
-
Updated
Jul 13, 2021 - PHP
Logical Replication extension for PostgreSQL 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
subscription
replication
etl
zero-downtime
postgresql
data-transformation
publish-subscribe
cdc
logical-decoding
data-transport
database-replication
-
Updated
Jun 29, 2021 - C
Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
microsoft
sdk
csharp
dotnet
examples
prose
data-transformation
program-synthesis
synthesis
data-wrangling
-
Updated
Jun 29, 2021 - C#
An open source alternative to Looker built using dbt. Made for analysts ❤️
-
Updated
Jul 27, 2021 - TypeScript
Like Awk but with SQL and table joins
-
Updated
May 27, 2021 - Tcl
-
Updated
May 6, 2021 - TypeScript
Advanced and Fast Data Transformation in R
data-science
cran
r
statistics
time-series
high-performance
data-transformation
scientific-computing
econometrics
rstats
data-analysis
data-manipulation
data-processing
weights
panel-data
weighted
data-aggregation
-
Updated
Jul 24, 2021 - R
Data transformation and utility functions for R
-
Updated
May 12, 2021 - R
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
spark
hadoop
algorithms
data-transformation
pyspark
partitioning-algorithms
mapreduce
data-algorithms
data-partition
mapreduce-algorithm
santa-clara-university
mapreduce-python
pyspark-algorithms-book
-
Updated
May 13, 2021 - HTML
A simple Spark-powered ETL framework that just works 🍺
data-science
machine-learning
framework
scala
big-data
spark
pipeline
etl
data-transformation
data-engineering
dataset
data-analysis
modularization
setl
etl-pipeline
-
Updated
Jul 7, 2021 - Scala
open-source
data-science
data
binder
integration
jupyter
pipeline
etl
engine
data-transformation
jupyterlab
notebooks
conector
-
Updated
Jul 26, 2021 - Python
A curated list of Clojure resources for dealing with domain-specific languages.
-
Updated
Jun 11, 2021
machine-learning
deep-learning
data-transformation
data-visualization
machine-learning-library
machine-learning-api
datasets
data-cleaning
ludwig
data-augmentation
automl
tpot
machine-learning-models
model-compression
model-deployment
autokeras
voice-computing
data-cleaning-pipeline
autopytorch
-
Updated
Jun 23, 2021 - Python
Reference Architectures for Datalakes on AWS
glue
amazon-emr
data-transformation
data-lake
data-catalog
data-analytics
hive-metastore
emr-cluster
ingest-data
-
Updated
May 13, 2020 - HTML
Data transformation toolkit
-
Updated
Jul 21, 2021 - Ruby
Wrangler Transform: A DMD system for transforming Big Data
data-science
big-data
parsing
avro
data-transform
data-transformation
project
transform-data
preparation
transform
wrangle
manipulate-data
cdap
cdap-plugin
data-prep
data-cleansing
-
Updated
Jul 23, 2021 - Java
Serialize PHP variables, including objects, in any format. Support to unserialize it too.
api
php
yaml
serialization
json
php7
json-api
xml
data-transformation
yml
jsonapi
transformer
hal
hal-api
xml-transformation
marshaller
json-transformation
array-transformer
yaml-transformer
jsend-transformer
-
Updated
Jul 26, 2021 - PHP
Daany - .NET DAta ANalYtics .NET 5 library with the implementation of DataFrame, Time series decompositions and Linear Algebra routines BLASS and LAPACK.
data-transformation
data-frame
ssa
series
iris
dataframe
mkl
data-frames
series-decomposition
mlnet
linear-algebra-routines
calculated-columns
daany-library
-
Updated
Jul 8, 2021 - C#
object flow treatment, data transformation
-
Updated
Jul 22, 2021 - JavaScript
A tool to read CSV files with CSVW metadata and transform them into other formats.
-
Updated
Apr 30, 2019 - Python
Foofah: programming-by-example data transformation program synthesizer
data-transformation
data-wrangling
data-preparation
data-cleaning
combinatorial-search
programming-by-example
inductive-program-synthesis
heursitic
-
Updated
Apr 23, 2018 - CSS
Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
excel
csv-files
tabular-data
data-transformation
power-bi
data-acquisition
data-visualization
open-data
data-visualisation
data-analytics
data-analysis
power-query
tabular-data-package
data-package
powerbi
json-table-schema
frictionlessdata
datapackage
-
Updated
Jun 10, 2021 - R
A PHP serialization component focused on performance
-
Updated
May 28, 2020 - PHP
bamboolib - template for creating your own binder notebook
docker
data-science
data-transformation
data-visualization
data-visualisation
data-viz
data-exploration
binder-jupyter-notebook
-
Updated
Jul 15, 2021 - Jupyter Notebook
ivozandhuis
commented
Feb 16, 2021
Observation
If you download the enriched csv, cow, ratt or rml file the name of the file is the name of the original uploaded {orgfilename}.csv file. The RDF file you download is called 'result.nt'.
Expected
{orgfilename}.nt
Improve this page
Add a description, image, and links to the data-transformation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-transformation topic, visit your repo's landing page and select "manage topics."
Right now the tutorial is coherently designed, tested, and even documented. However, it doesn't build up in a way that's very beginner friendly. It establishes glom's value and then immediately uses it at an intermediate level.
I'd like it if it was a bit more drawn out to use basic features first and then add a multi-line
Coalesceas the