Code Review Stack Exchange is a question and answer site for peer programmer code reviews. Join them; it only takes a minute:

Sign up

Here's how it works:

Anybody can ask a question
Anybody can answer
The best answers are voted up and rise to the top

JVM wrapper for a Python function

up vote 2 down vote favorite

Context:

The following function is used a JVM wrapper for a Python function, which is executed in a SQL-like environment. If you're interested in example usage you can check Cumulate arrays from earlier rows (PySpark dataframe) on Stack Overflow but it shouldn't be necessary.
Objects passed between are converted using Pyrolite and / or Py4J and matching exact types expected by the JVM counterpart is crucial.
There are two contexts of execution with internal flatten_distinct_ being executed in a remote interpreter which is not directly accessible.
Docstrings have been omitted intentionally.

from typing import List, Union, Hashable as Hble

from toolz import unique, concat, compose

import pyspark.sql.functions as f
from pyspark.sql import Column
from pyspark.sql.types import ArrayType, StringType, DataType as DT


def flatten_distinct(col: Union[Column, str], dt: DT=StringType()) -> Column:
    assert isinstance(col, (Column, str)), (
        "col should be Column or str got {}".format(type(col)))

    def flatten_distinct_(xss: Union[List[List[Hble]], None]) -> List[Hble]:
        return compose(list, unique, concat)(xss or [])

    return f.udf(flatten_distinct_, ArrayType(dt))(col)

My main concern is if this code is self-explanatory and readable for a Python user with focus on:

Usage of the type annotations.
Function composition (would pipe(xxs or [], concat, unique, list) be a better choice?).
Immediate call of the anonymous functions.

edited Jan 6 at 23:41

Jamal♦

29k10108219

asked Jan 5 at 21:18

user6910411

612

This question has an open bounty worth +50 reputation from user6910411 ending in 11 hours.

This question has not received enough attention.

add a comment |

Your Answer

Sign up or log in

Post as a guest

Name

Post as a guest

Name

discard

By posting your answer, you agree to the privacy policy and terms of service.

Browse other questions tagged python python-3.x functional-programming or ask your own question.

question feed

asked	17 days ago
viewed	58 times

current community

your communities

more stack exchange communities

JVM wrapper for a Python function

This question has an open bounty worth +50 reputation from user6910411 ending in 11 hours.

Your Answer

Browse other questions tagged python python-3.x functional-programming or ask your own question.

Hot Network Questions

current community

your communities

more stack exchange communities

JVM wrapper for a Python function

This question has an open bounty worth +50 reputation from user6910411 ending in 11 hours.

Can you help? Code Review Stack Exchange depends on everyone sharing their knowledge. If you're able to answer this question, please do!

Your Answer

Sign up or log in

Post as a guest

Browse other questions tagged python python-3.x functional-programming or ask your own question.

Related

Hot Network Questions