Take the 2-minute tour ×
Stack Overflow is a question and answer site for professional and enthusiast programmers. It's 100% free, no registration required.

I have a csv like this:

Date, i, eat, chicken, you, fish, banana
2014-9-14, 1, 2, 1, 1, 1, 0
2014-10-15, 1, 1, 1, 0, 0, 0
2014-11-13, 0, 1, 0, 1, 0, 1

Forget about upper/lowercase and stemming because I will be morphological analyzing Japanese texts.

Ultimate Goal: http://imgur.com/uyTRQXR (I do not have enough reputation to post images.)

Note: Y-axis is the word counts. It doesn't have to be a dot, x, square and some random shapes, just dots/x with different colors will be fine.

I want to use ggplot instead of matplotlib if possible.

share|improve this question
    
How to decide eat on 2014-09-14 at a scale of 2 –  Wazzzy Sep 26 '14 at 5:32
    
@Wazzzy There are two 'eat' in the sentence of 'I eat* chicken, you eat* chicken', there has been some mistakes on the mData, will correct it now, sorry. –  Sayyora Sep 26 '14 at 5:36
    
Mistake: I eat chicken, you eat fish. –  Sayyora Sep 26 '14 at 5:54
    
So what's your question? Is it how to plot a date vs. frequency scatterplot with custom icons, or how to do word counts that output that .csv? If both, then please separate this into two separate questions. –  Adam Sep 26 '14 at 6:39
1  
I can't see the relevance of these numbers being word counts in a Japanese text to the question in its current form. I suggest removing irrelevant information. –  Lev Levitsky Sep 26 '14 at 7:32

1 Answer 1

You're probably looking for a form of scatter plot. The first example on the ggplot project page shows an how to configure one.

The data should already be in the form which you need for the plot. In your case, you probably should omit the stat_smooth() since the words aren't really correlated.

share|improve this answer

Your Answer

 
discard

By posting your answer, you agree to the privacy policy and terms of service.

Not the answer you're looking for? Browse other questions tagged or ask your own question.