Skip to content
#

arrow

Here are 187 public repositories matching this topic...

Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.

  • Updated Jan 29, 2021
  • C++
brandon-b-miller
brandon-b-miller commented Jan 4, 2021

Is your feature request related to a problem? Please describe.
It might be useful to have a singular clean and performant way to check if all the columns of a dataframe are of the same dtype, such as a DataFrame property _is_homogeneous. This comes up in a lot of places, such as where we might want to dispatch to a cupy matrix implementation (Transpose, some row wise reductions I believe

andygrove
andygrove commented Jul 25, 2020

Update the TPCH example to support query 6:

select
	sum(l_extendedprice * l_discount) as revenue
from
	lineitem
where
	l_shipdate >= date ':1'
	and l_shipdate < date ':1' + interval '1' year
	and l_discount between :2 - 0.01 and :2 + 0.01
	and l_quantity < :3;
blazingsql
beckernick
beckernick commented Nov 18, 2020

I'd like to be able to use INITCAP to capitalize the first character of every "token" in a string. This is listed as a supported operation on string columns in the Calcite reference, but is currently not implemented.

from pyspark.sql import SparkSession
from blazingsql import BlazingContext
import pandas as pd
​
​
# spark = SparkSession.builder \
#     .master("local") \
#    
lr4d
lr4d commented Jan 19, 2021

Problem description

If CI fails at the moment, it could be because linting has gone wrong, test have failed or the release has failed.

IMO, it makes more sense to have a separate workflow/job for linting, and for release, isolated from testing. This way, it is easier for a contributor to spot what part of the job has failed.

Isolating linting from testing would also speed up the CI as w

Improve this page

Add a description, image, and links to the arrow topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the arrow topic, visit your repo's landing page and select "manage topics."

Learn more