Tagged Questions
2
votes
0answers
43 views
How should I distribute a classifier to customers?
When consulting, I often do my exploratory analysis and prototyping in R, and deliver results on the initial dataset to the client. The client wants to use the trained classifier in a production ...
0
votes
0answers
35 views
Weighting and time series with machine learning
I'm trying to produce a model to predict the price of a product on the basis of several factors effecting previous time-stamped sales. I am certain that older sales are less relevant to the prediction ...
2
votes
0answers
60 views
detecting circadian rhythm in a time series
I have a sensor that can detect minute changes in distance. It produces a time series.
I would like to point it at people and detect things like their sleeping pattern. How would one build a system ...
2
votes
1answer
146 views
Simple text classifier: classification taking forever?
I work for a small tech startup, and I want to classify or users into demographics based on the domain of their email address. When users sign up to our site, they can enter a job category, or pick ...
2
votes
1answer
106 views
Simple machine learning: bot detection
I've been aching to get my feet wet with a machine learning project, and I've found one that should be relatively simple, and actually has non-negligible business value for my organization. The ...
5
votes
2answers
536 views
Pandas / Statsmodel / Scikits-learn
Are Pandas, Statsmodels and scikits-learn different implementations of machine learning/statistical operations, or are these complementary to one another?
Which of these has the most comprehensive ...
0
votes
0answers
66 views
Mixing categorial and continuous data in Naive Bayes classifier using scikit-learn
I'm using scikit-learn in Python to develop a classification algorithm to predict gender of a certain customers. Amongst others I want to use the Naive Bayes classifier but my problem is that I have a ...
6
votes
1answer
1k views
R vs Python for Data Analysis [duplicate]
Possible Duplicate:
Python as a statistics workbench
I am just starting out with data analysis and machine learning. From the books that I am reading/have read Python and R seem to be the ...
3
votes
1answer
100 views
Practical applications of affinity propagation
I am learning about machine learning here.
They took a set of prices for specific companies on the stock market and graphed them:
I would like to know what are some practical applications of ...
2
votes
2answers
309 views
Implementation of CRF in python
Is there a popular implementation of Conditional Random Fields in Python?
I can't seem to find any that is widely used and popular!
0
votes
0answers
150 views
Best platform for running (python and ( R or Octave)) algorithms for (large/big) data analytics [closed]
I have a machine learning algorithm currently implemented in R, wrapped in python (rpy2).
I would like to deploy this inside a web application and I am looking for the right platform to do this, ...
4
votes
3answers
524 views
Resources for learning how to implement ensemble methods
I understand theoretically (sort of) how they would work, but am not sure how to go about actually making use an ensemble method (such as voting, weighted mixtures, etc.).
What are good resources ...
8
votes
3answers
183 views
What is the most efficient way of training data using least memory?
This is my training data: 200,000 Examples x 10,000 Features.
So my training data matrix is - 200,000 x 10,000.
I managed to save this in a flat file without having memory issues by saving every ...
0
votes
2answers
237 views
Collinear variables in Multiclass LDA training
I'm training a Multi-class LDA classifier with 8 classes of data.
While performing training, I get a warning of: "Variables are collinear"
I'm getting a training accuracy of over 90%.
I'm using ...
0
votes
0answers
198 views
Multi Class vs 2 class Naive Bayes
I was wondering what are the implications of using a multi-class Naive Bayes versus a 2 class Naive Bayes (for one against everything).
Which technique performs better?
I've previously came across ...