Skip to content
#

streaming-data

Here are 299 public repositories matching this topic...

demitri
demitri commented Aug 19, 2021

Problem description

I am getting the following error when reading a file from an S3 bucket:

Invalid bucket name "xxxx:yyyy@bucket": Bucket name must match the regex "^[a-zA-Z0-9.\-_]{1,255}$" or be an ARN matching the regex "^arn:(aws).*:s3:[a-z\-0-9]+:[0-9]{12}:accesspoint[/:][a-zA-Z0-9\-]{1,63}$|^arn:(aws).*:s3-outposts:[a-z\-0-9]+:[0-9]{12}:outpost[/:][a-zA-Z0-9\-]{1,63}[/:]acce
cybertyche
cybertyche commented Dec 13, 2018

Implement progressive versions of hopping and tumbling windows:

  • Both window macro methods should get added versions that take an additional parameter
  • The parameter should represent the time interval that should be used to produce intermediate results of aggregations
  • The parameter should be a clean divisor of the tumble size for tumbling windows and the hop size for hopping windows
help wanted good first issue
wissousad
wissousad commented Aug 6, 2019

Hello, I have a CSV file that has 9 features and 9 expected targets, and I want to test 2 regression models on this data (that should be generated as a stream).

When I test the MultiTargetRegressionHoeffdingTree and RegressorChain on this data I get a bad R2-score, but when I tried normalizing my data with scikit-learn I get a pretty good R2-score. The problem is that I should not use sci

enhancement help wanted good first issue
memgraph
antonio2368
antonio2368 commented Jan 4, 2022

CASE doesn't work well with null. This works as expected and prints 'works':

WITH 2 AS name
RETURN CASE name
    WHEN 2 THEN 'works'
    WHEN null THEN "doesn't work"
    ELSE 'something went wrong'
END

If we swap the first case from 2 to 3. It should print 'something went wrong', but instead it prints "doesn't work":

WITH 2 AS name
RETURN CASE name
    WHEN 3 THEN 'works'
good first issue
cloudflow
michaelpnash
michaelpnash commented Apr 23, 2020

It is currently hard for users to track which versions of dependencies they are getting and which versions they should use when adding extra dependencies to their projects. This results in code like this in our own example projects:

libraryDependencies ++= Seq(
        "com.lightbend.akka"     %% "akka-stream-alpakka-file"  % "1.1.2",
        "com.typesafe.akka"      %% "akka-http-spray-js
good first issue priority/nice-to-have kind/maintenance Backlog
carlbrochu
carlbrochu commented Apr 18, 2019

Is your feature request related to a problem? Please describe.
Today the user needs to deploy udf jars and reference data csvs manually to the blob location

Describe the solution you'd like
Enable the user to choose a file on a local disk which the web portal will then upload to the right location

enhancement help wanted good first issue

Data stream analytics: Implement online learning methods to address concept drift in data streams using the River library. Code for the paper entitled "PWPAE: An Ensemble Framework for Concept Drift Adaptation in IoT Data Streams" accepted in IEEE GlobeCom 2021.

  • Updated Mar 24, 2022
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the streaming-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the streaming-data topic, visit your repo's landing page and select "manage topics."

Learn more