pydata
Here are 87 public repositories matching this topic...
-
Updated
Aug 7, 2021 - Python
The stumpy.snippets feature is now completed in #283 which follows this work:
We have a rough notebook t
Similar to dask/dask#7800, we can replace our tmpfile context manager
with tempfile.TemporaryFile from the standard library.
Brief Description of Fix
When I see docs I found [get_features_targets page](https://pyjanitor-devs.github.io/pyjanitor/reference/janitor.functions/janitor.get_features_targe
-
Updated
Dec 27, 2016
Problem description
Reading a dataset with eager's read functionality raises a ValueError when providing columns.
Example code (ideally copy-pastable)
import pandas as pd
from tempfile import TemporaryDirectory
from functools import partial
from storefact import get_store_from_url
from kartothek.io.eager import store_dataframes_as_dataset, read_dataset_as_data-
Updated
Jan 12, 2018 - HTML
-
Updated
Oct 18, 2016 - Jupyter Notebook
In trying to write tests for #189, I'm finding very difficult to add columns to existing tests, as in some cases like the all_types table, the table is defined in a separate file than the tests and multiple tests try to write to the same table.
Additionally, our test suite doesn't prove that the data that are uploaded are the same as the data downloaded for all types.
We should consider m
-
Updated
Aug 30, 2021 - Python
-
Updated
Jun 10, 2021 - Jupyter Notebook
-
Updated
Jul 30, 2017 - Jupyter Notebook
-
Updated
Jan 17, 2021 - Jupyter Notebook
For association testing and PCA (at least), it may be useful to have a function that imputes dosages/allele counts. With floating point values (i.e. from bgen), this can be very simple as a user, e.g. ds.call_genotype_probability.fillna(ds.call_genotype_probability.mean(dim="samples")). With alternate allele counts having a sentinel integer, it is a little more complicated. The best way t
-
Updated
Aug 15, 2021 - Jupyter Notebook
-
Updated
Jul 2, 2018 - Jupyter Notebook
-
Updated
Aug 14, 2018 - HTML
-
Updated
Sep 24, 2018 - Shell
-
Updated
Feb 28, 2021 - Jupyter Notebook
-
Updated
Nov 16, 2018 - Jupyter Notebook
-
Updated
Sep 14, 2017 - Jupyter Notebook
Improve this page
Add a description, image, and links to the pydata topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the pydata topic, visit your repo's landing page and select "manage topics."
I'm hoping to get an idea of the memory size of a dask.dataframe once I call .compute() on it
My current approach is