Machine learning provides computer algorithms that automatically discover patterns in data and make intelligent decisions from them.
6
votes
1answer
67 views
How to optimize K-nearest neighbours in C# for large number of dimensions?
I'm implementing the K-nearest neighbours classification algorithm in C# for a training and testing set of about 20,000 samples and 25 dimensions.
There are only two classes, represented by '0' and ...
2
votes
0answers
72 views
implementation of a new algorithm for sklearn
In the python library sklearn is implemented the algorithm for SparsePCA.
I have written the code for a another version of this algorithm that is much faster in some situations. I have not enough ...
0
votes
0answers
241 views
Stochastic gradient descent squared loss
I have implemented stochastic gradient descent in matlab and I would like to compare my results with another source but the error I am getting is higher (I am using squared error). I am worried I am ...
0
votes
0answers
64 views
1
vote
1answer
55 views
Compute logistic regression on tweet objects
Is my approach good to naming variables and exception handling? I would like to make this code more robust and maintainable. I need advice on exception handling, var naming and comments.
...
5
votes
2answers
137 views
Defensive programming type-checking
I have issues with dynamically typed languages, and I tend to worry about type a lot.
Numpy has different behaviour depending on if something is a matrix or a plain ndarray, or a list. I didn't ...
7
votes
1answer
785 views
Clojure Neural Network
After reading this article about Neural Networks I was inspired to write my own implementation that allows for more than one hidden layer.
I am interested in how to make this code more idiomatic - ...
5
votes
1answer
415 views
Why does the LR on spark run so slowly?
Because the MLlib does not support the sparse input, I ran the following code, which supports the sparse input format, on spark clusters. The settings are:
5 nodes, each node with 8 cores (all the ...
7
votes
2answers
276 views
Python neural network: arbitrary number of hidden nodes
I'm trying to write a neural network that only requires the user to specify the dimensionality of the network. Concretely, the user might define a network like this:
...
4
votes
1answer
3k views
Alternative to Python's Naive Bayes Classifier for Twitter Sentiment Mining
I am doing sentiment analysis on tweets. I have code that I developed from following an online tutorial (found here) and adding in some parts myself, which looks like this:
...
3
votes
2answers
291 views
C++ and STL - Machine Learning Problem
I would like to get some general comments on style and use of STL in particular. This is some code I wrote to do machine learning classification (logistic regression). Any suggestions would be very ...