Skip to content
#

dataframe

Here are 454 public repositories matching this topic...

eladmw
eladmw commented Aug 13, 2020

Hello,
Considering your amazing efficiency on pandas, numpy, and more, it would seem to make sense for your module to work with even bigger data, such as Audio (for example .mp3 and .wav). This is something that would help a lot considering the nature audio (ie. where one of the lowest and most common sampling rates is still 44,100 samples/sec). For a use case, I would consider vaex.open('Hu

ritchie46
ritchie46 commented Apr 10, 2021

TurboDBC is probably the fastest method to communicate to a db with arrow data. We can implement utility functions for this in Python.

If turbodbc isn't installed we raise an exception. It probably has to be installed with conda, so we cannot add it as optional dependency.

All required arrow interop functions are already available to make this work.

danfojs
dorienh
dorienh commented Apr 12, 2021

This may just be a problem with data, but I am getting blown up sortino ratios. I thought they had to be much smaller.

For instance:

import yfinance as yf 

msft = yf.Ticker("ETH-USD")

def getSortino(h, p):
  sh = []
  for i in range(0, len(h)):
    if i > p:
      a = h['Close'].iloc[i-p:i]
      sh.append(ta.sortino_ratio(a))
    else: sh.append(0)

  return sh
anks7190
anks7190 commented Jan 27, 2021

Hi ,

I am using some basic functions from pyjanitor such as - clean_names() , collapse_levels() in one of my code which I want to productionise.
And there are limitations on the size of the production code base.
Currently ,if I just look at the requirements.txt for just "pyjanitor" , its huge .
I don't think I require all the dependencies in my code.
How can I remove the unnecessary ones ?

pdpipe

Improve this page

Add a description, image, and links to the dataframe topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dataframe topic, visit your repo's landing page and select "manage topics."

Learn more