Manipulation of text by programs, scripts, etc.
4
votes
4answers
61 views
Print line after nth occurrence of a match
I am looking to display the line 4598 in the following file. Effectively I want to display the line AFTER the nth occurrence of a match. In this case, the line after the 3rd occurrence of <Car>. ...
4
votes
5answers
58 views
Reformat each two-line sequence into two columns
I would like to tab my printout:
1 2
3 4
5 6
7 8
to
1 2 5 6
3 4 7 8
How to adjust it using simple command line (awk or sed)??
1
vote
3answers
46 views
Count rows with specific integer in a column
I have 6 columns, each with multiple rows.
I want to count the number of rows which have the integer 4 or 5 in the fifth column.
A1 jhfj jdhfjkhd kdkfjjh 5 jhsdjkfh
A2 ujhf jhdfhsd dsfkks 4 ...
0
votes
4answers
32 views
What are alternative tools for searching files in full content?
I realized that most of the text related tools I use, grep/egrep/awk/sed are line-oriented.
I guess that is a simplified approach to search in files but I was wondering if there is another set of ...
1
vote
1answer
34 views
Insert a line after a block of text with sed
I have text file: file.conf
### Option: LogFile
# Name of log file.
# If not set, syslog is used.
#
# Mandatory: no
# Default:
# LogFile=
### Option: LogFileSize
# Maximum size of ...
5
votes
1answer
68 views
counting multiple patterns in a single pass with grep?
I've written a grep loop to iteratively count DNA trinucleotides within a gzipped DNA fasta file containing DNA sequences e.g.
declare -a tri=(AAA AAC AAG AAT CAA .. etc)
for i in ${tri[@]}
do
...
-1
votes
1answer
44 views
Importing an excerpt from a file into a script
I'm trying to import an excerpt from a file into a script for processing.
LIST=$(sed '1,/+++NETWORKLIST+++/d' < /path/to/file | sort -t ";" -k2,2r)
My problem is that this script is not deleting ...
2
votes
3answers
50 views
Print Column if values is between two other values
I am fairly new to data analysis and I have a problem with filtering some of my values.
I have data arranged in a file with four columns like this:
A 1 10 5
B 10 100 120
C ...
2
votes
4answers
75 views
Create an ASCII art table from tabular data
Given perhaps comma- or tab-delimited input, I'd like to present a series of appropriately padded columns to stdout, so I can easily scan columnar information which would otherwise present rather ...
3
votes
4answers
46 views
Replace all newlines to space except the last
How can I replace all newlines with space except the last newline.
I can replace all newline to space using tr but how I can do it with some exceptions?
2
votes
2answers
28 views
print all matches or replace all strings in a BIG file which is NOT line organised (no line separators)
The 5GB files I have are streams of data rows formed:
{datarow1...},{datarow2...},...,{datarowN...}
so actually could say that there are lines {}, and even line separators, but coming as a ...
1
vote
1answer
62 views
grep to extract a substring from a huge string
I am extracting certain information from a website using curl command. I need to extract a certain subsection of the string using grep command. For example, using curl command I am getting the HTML ...
2
votes
2answers
41 views
Replacing a pattern with another between first and second occurrence of a pattern in file
Want the comma's (,) replaced by the alphabet u between the first occurrence of y till the second occurrence of y. Given below is the sample input file.
100y00893225900,008932259,06,2013,NOKIA Q ...
18
votes
3answers
915 views
Unix command to check if any two lines in a file are same?
Is there a unix command that can check if any two lines in a file are the same?
For e.g. Consider a file sentences.txt
This is sentence X
This is sentence Y
This is sentence Z
This is sentence X
...
-2
votes
3answers
48 views
Deleting Lines, which match a particular Identifier from another file
I have 2 files. File 1, has an identifier (eg. D7MHBF:11:1449:1988) and every new entry starts with @. It has few more fields, which are not important in our analysis.
File 2 consists of a column of ...