Modify and import as numpy.ndarray a txt file in python

Question

To import the data contained into the file my_file.txt that have the form:

Label[0] = 0.980252
Label[1] = -nan
Label[2] = -nan
Label[3] = -nan
Label[4] = 0.664706
Label[5] = -nan
Label[6] = -nan
Label[7] = -nan
Label[8] = -nan
Label[9] = -nan
Label[10] = -nan
Label[11] = 0.800183
Label[12] = -nan
Label[13] = -nan
Label[14] = -nan
Label[15] = 0
Mean Data = 15

I wrote the following code:

import numpy as np

with open('myfile.txt', 'r') as file_txt_original:
    data = file_txt_original.read()

    data = data.replace('Mean data', '-1')
    data = data.replace('Label[', '')
    data = data.replace(']', '')
    data = data.replace(' = ', ', ')

    file_txt_original.close()

with open('new_file.txt', 'w') as file_txt_copy:

    file_txt_copy.write(data)
    file_txt_copy.close()

my_array = np.loadtxt('new_file.txt', delimiter=',')

It works but this to me seems still quite an tricky solution... Any suggestion to improve this code without doing so many replacement or without saving an additional structure?

ferada · Accepted Answer · 2016-03-09 12:27:17Z

I don't quite get why the data is written out into a new file again; it would be more "typical" to parse each line and create the array simultaneously.

That said, apart from that concern the only other thing I'd like to point out is that the close calls on the file objects aren't necessary because you (absolutely correctly) already put them in a with block, so that the close method will be automatically called if the block is exited.

Edit:

Okay, so for clarification, I mean something like the following:

import re

import numpy as np

with open('myfile.txt', 'r') as file_txt_original:
    my_array = np.array([])

    for line in file_txt_original:
        matches = re.match("Label\[(\d+)] = (.*)", line)
        if matches:
            index, value = matches.groups()
            index = int(index)
            if index >= my_array.size:
                my_array.resize(index + 1)
            my_array[index] = float(value)

Obviously it would be much better to the size of the array from the start, or maybe collecting things into a list and only allocate the array at the end.

Yes, thanks, I tried to follow your suggestion, but I can not find any function to load the string 'data' into a numpy array without saving it before in txt and after having performed the modifications. Maybe I am missing something easy... — SeF, Mar 9 at 10:43

A. Romeu · Answer 2 · 2016-03-08 16:50:36Z

up vote 1 down vote

You can concatenate the replace strings after you open the file, it will give better visibility

data = file_txt_original.read()
.replace('Mean data', '-1')
.replace('Label[', '')
.replace(']', '')
.replace(' = ', ', ')

answered Mar 8 at 16:50

A. Romeu

1548

add a comment |

asked	7 months ago
viewed	39 times
active	7 months ago

current community

your communities

more stack exchange communities

Modify and import as numpy.ndarray a txt file in python

2 Answers 2

Your Answer

Not the answer you're looking for? Browse other questions tagged python data-importer or ask your own question.

Hot Network Questions

current community

your communities

more stack exchange communities

Modify and import as numpy.ndarray a txt file in python

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged python data-importer or ask your own question.

Related

Hot Network Questions