1
vote
4answers
150 views

text categorization classifiers

Does anybody know of good open-source text-categorization models? I know about Stanford Classifier, Weka, Mallet, etc. but all of them require training. I need to classify news articles into ...
0
votes
0answers
45 views

wordnet for Arabic language using eclipse [closed]

my project is to built an application which analyse sentiments in Arabic language and classify them into positive and negative , I read a lot of documents and I knew that sentiment should be analysed ...
0
votes
0answers
50 views

Classifier4J provide custom stopwords

I wan't to use the SimpleSummarizer from Classifier4J. I'm accessing the mothods via Jython but so far everything has worked just like it would in java. Now since my language is german, I need to ...
2
votes
2answers
111 views

How to predict a continuous value (time) from text documents? [closed]

I have about 3000 text documents which are related to a duration of time when the document was "interesting". So lets say document 1 has 300 lines of text with content, which led to a duration of ...
0
votes
0answers
62 views

Weka: Not Getting Output in Java API

I'm implementing a classified filter, that filters based on attributes present in the training data. //implement filter that standardizes via training set Standardize filter = new Standardize(); ...
0
votes
1answer
50 views

why doesn't liblinear predict the majority class?

Most machine learning classifiers, when encountering an instance with no features that it's seen before, would classify the example with the class that was most frequent in the training data. This ...
4
votes
2answers
89 views

Parameters of a Weka Classifier

I use Weka. I would like to access the parameters (the weights) of a classifier. What I want to do is to access the values of the parameters to determine how they have an influence on the attributes. ...
0
votes
1answer
182 views

libsvm c# classification example

I'm currently working on libsvm and I want to make a svm classifier on a data set, let me explain : for example I have a data file like that good 0.1 2.5 10 11 1.24 . bad 10 11.3 23.2 7.4 4.2 . ...
0
votes
1answer
58 views

how to add malicious features for classification in weka as a data set

I am doing a project on how to detect and classify malicious content using weka data mining tool. I have developed an algorithm but the problem is I don't know how and where to add malicious features ...
1
vote
1answer
71 views

Using Weka for classification

I'm new to Weka! My purpose is to classify an existing article to 2 categories: EDUCATION or ENTERTAINMENT. I already have 400 articles in EDUCATION category and 400 articles in ENTERTAINMENT. So ...
1
vote
2answers
338 views

Loding a weka model to a java code

I have saved the result of weka classification by right clicking on the model and selecting "save model". Now, I want to load it and work with in my "JAVA" application. How can I do that? Models could ...
0
votes
1answer
398 views

How to test a Weka Text Classification (FilteredClassifier)

Looked at lots of examples for this, and so far no luck. I'd like to classify free text. Configure a text classifier. (FilteredClassifier using StringToWordVector and LibSVM) Train the classifier ...
0
votes
1answer
42 views

which approach for readability classifier

I'm afraid my understanding of the theory behind classifiers is not deep, so please excuse me if my question looks naive to you. Goal: Given an arbitrary text, classify it according to age ranges, ...
4
votes
2answers
344 views

Handling missing attributes in Naive Bayes classifier

I am writing a Naive Bayes classifier for performing indoor room localization from WiFi signal strength. So far it is working well, but I have some questions about missing features. This occurs ...
0
votes
0answers
175 views

Weka java api classification issue

I am trying to build a text classifier based on weka SMO algo. I have created the following code based on diferent resources http://pastebin.com/vSek2gZ9 But it is not giving the actual result. I am ...
2
votes
1answer
97 views

How should i model the training set in my neural network?

I have a silly confusion but it is bothering me a lot. I have to make an ANN for spam detection. Till now I have developed modules for developing tfidf vectors of mails and calculating PCA of that ...
-1
votes
1answer
171 views

Image Classification into Good or Bad Images

We want to tell whether an image is good or bad. There are a fixed set of checks we do to classify an image into good or bad category. Example: 1. Background color. 2. Height X Width ratio. 3. No ...
4
votes
1answer
118 views

Bayesian classification for semi-structured data in Java

I would like to train and use a bayesian classifier for the following situation: Semi-structured data - basically an XML schema Information is contained in multiple plain text fields Some fields / ...
0
votes
1answer
285 views

How to improve performance of SMO classifier in weka?

I am using weka SMO classifier for classify the documents.There are many parameters for smo available like Kernal, tolerance etc.., I tested using different parameters but i not get good result large ...
0
votes
0answers
118 views

Update a Trained Weka Classifier with a New “Class” Instance

I implemented an IBK classifier and trained it as following (this is just for testing purpose): ArrayList<Attribute> atts = new ArrayList<Attribute>(); ArrayList<String> classVal = ...
2
votes
2answers
542 views

Create an unlabeled instance in Weka

I trained an IBK classifier with some training data that I created manually as following: ArrayList<Attribute> atts = new ArrayList<Attribute>(); ArrayList<String> classVal = new ...
3
votes
1answer
252 views

Training a weak learner

I'm implementing an application using AdaBoost to classify if an elephant is Asian or African elephant. My input data is: Elephant size: 235 Elephant weight: 3568 Sample weight: 0.1 Elephant type: ...
0
votes
0answers
106 views

Turning Weka classifier into natural language - Image Analysis and using Classifiers [closed]

I have got a set of models from Weka, which I have been able to read directly into my code, as opposed to using Weka. As a result I now need to turn the rules, into readable and understandable English ...
-3
votes
1answer
85 views

calculating error rate of a system comming up with percentage answers

I would appreciate ideas in this regard. Imagine I have a software (constraint satisfaction solving) which solves a problem and comes up with the answers like this: 100 % A is the solution, 100 % B ...
0
votes
1answer
741 views

Creating training and test set in weka using StratifiedRemoveFolds example

I am trying to run some classification using weka from the command line. In this link weka-Primer-commandline there is the following example for creating test and training sets: java ...
-1
votes
1answer
90 views

What's the best classification method for capturing packets in heavy traffic

I want to capture packets from a crowded network and classify it by user request. I use java and jpcap for capturing packets. now I'm using array threads for each req packet. it's realy slow and there ...
0
votes
2answers
89 views

Finding the number of times each word in a hashset occurs in text document

I'm implementing a Naive Bayes text classification algorithm in Java. What I have done so far is, declare a hashset called Vocabulary which stores all the unique words from a given text file (test ...
-1
votes
1answer
166 views

How to assign a class to an instance using saved model in weka

Before I decided to post the problem I'm having I've read a lot of posts and I still couldn't get a clear answer. So here it is: Using weka I have trained a NaiveBayesTree with my training data that ...
0
votes
0answers
109 views

Getting NotSerializableException with Weka.classifiers.multilabel.CC

Program Trace import weka.classifiers.multilabel.*; //from meka.jar CC cc = new CC(); // weka.classifiers.multilabel.CC //Classification Chains Method cc.buildClassifier(TrainingSet); Evaluation ...
3
votes
1answer
2k views

Lucene: exception - Query parser encountered <EOF> after “some word”

I am working on a classification problem to classify product reviews as positive, negative or neutral as per the training data using Lucene API. I am using an ArrayList of Review objects - ...

1 2 3
15 30 50 per page