Skip to content
#

streaming-data

Here are 297 public repositories matching this topic...

benthos
heikkilamarko
heikkilamarko commented Jan 3, 2022

Under the hood, Benthos csv input uses the standard encoding/csv packages's csv.Reader struct.

The current implementation of csv input doesn't allow setting the LazyQuotes field.

We have a use case where we need to set the LazyQuotes field in order to make things work correctly.

enhancement inputs good first issue effort: lower
kafka-ui
agolosen
agolosen commented Apr 24, 2022

Describe the bug

After Restart failed Task in a connector with failed tasks the State is still FAILED but the Tasks Failed is 0 at the same time, even if switch the tab to Tasks and return in the connector.
In my case both indicators matched after return to Connectors overview (however it could be just coincidence and the state ne

type/bug good first issue scope/backend status/accepted
demitri
demitri commented Aug 19, 2021

Problem description

I am getting the following error when reading a file from an S3 bucket:

Invalid bucket name "xxxx:yyyy@bucket": Bucket name must match the regex "^[a-zA-Z0-9.\-_]{1,255}$" or be an ARN matching the regex "^arn:(aws).*:s3:[a-z\-0-9]+:[0-9]{12}:accesspoint[/:][a-zA-Z0-9\-]{1,63}$|^arn:(aws).*:s3-outposts:[a-z\-0-9]+:[0-9]{12}:outpost[/:][a-zA-Z0-9\-]{1,63}[/:]acce
cybertyche
cybertyche commented Dec 13, 2018

Implement progressive versions of hopping and tumbling windows:

  • Both window macro methods should get added versions that take an additional parameter
  • The parameter should represent the time interval that should be used to produce intermediate results of aggregations
  • The parameter should be a clean divisor of the tumble size for tumbling windows and the hop size for hopping windows
help wanted good first issue
wissousad
wissousad commented Aug 6, 2019

Hello, I have a CSV file that has 9 features and 9 expected targets, and I want to test 2 regression models on this data (that should be generated as a stream).

When I test the MultiTargetRegressionHoeffdingTree and RegressorChain on this data I get a bad R2-score, but when I tried normalizing my data with scikit-learn I get a pretty good R2-score. The problem is that I should not use sci

enhancement help wanted good first issue
memgraph
antonio2368
antonio2368 commented Jan 4, 2022

CASE doesn't work well with null. This works as expected and prints 'works':

WITH 2 AS name
RETURN CASE name
    WHEN 2 THEN 'works'
    WHEN null THEN "doesn't work"
    ELSE 'something went wrong'
END

If we swap the first case from 2 to 3. It should print 'something went wrong', but instead it prints "doesn't work":

WITH 2 AS name
RETURN CASE name
    WHEN 3 THEN 'works'
good first issue
cloudflow
michaelpnash
michaelpnash commented Apr 23, 2020

It is currently hard for users to track which versions of dependencies they are getting and which versions they should use when adding extra dependencies to their projects. This results in code like this in our own example projects:

libraryDependencies ++= Seq(
        "com.lightbend.akka"     %% "akka-stream-alpakka-file"  % "1.1.2",
        "com.typesafe.akka"      %% "akka-http-spray-js
good first issue priority/nice-to-have kind/maintenance Backlog
carlbrochu
carlbrochu commented Apr 18, 2019

Is your feature request related to a problem? Please describe.
Today the user needs to deploy udf jars and reference data csvs manually to the blob location

Describe the solution you'd like
Enable the user to choose a file on a local disk which the web portal will then upload to the right location

enhancement help wanted good first issue

Data stream analytics: Implement online learning methods to address concept drift in data streams using the River library. Code for the paper entitled "PWPAE: An Ensemble Framework for Concept Drift Adaptation in IoT Data Streams" accepted in IEEE GlobeCom 2021.

  • Updated Mar 24, 2022
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the streaming-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the streaming-data topic, visit your repo's landing page and select "manage topics."

Learn more