BitVector Class: Performance

Question

Lately, I've been reading up on the Common Language Infrastructure's data formats. As a result, one thing it mentioned is some of the data is stored in a BitVector.

I hadn't ever used a BitVector before, so I decided to do some reading. A bit vector is basically an array of bits which allows you to greatly reduce the necessary space to store something, especially if certain elements are require few bits to represent.

After some banging on the keyboard (and reading), I had the following code:

BitVector.cs (As html)

Note to moderators: I had to post it to my own FTP account because it was too large for the submission, it exceeds 30000 characters and is somewhere around 51000.

The question I have for you ladies and gents is: what kind of performance can I expect out of the code? I've written something similar previously for working with unicode character sets, creating unions, intersections, exclusive disjunctions, and so on for non-deterministic and deterministic state machines (which use regex like patterns that require large bitfields for the unicode character set, et cetera). Previously the focus was on creating the smallest possible footprint, which yielded an offset and the smallest set of data possible.

The goal this time is to construct a bit field which is capable of reading existing data (non reduced), being able to arbitrarily read integers of varying sizes from this data.

The TrueCount() method involves using a single byte for each possible variation of a ushort (16-bit integer.) The table itself is stored in a small file (873 bytes worth) within the Resources of the program. If anyone wants the file I can post it, but it's pretty simple to calculate.

Why are you creating your own structure instead of using BitArray, BitVector32 or BinaryReader?
Because BitArray sucks, BitVector32 is only for 32-bit integers, and neither BinaryReader nor the other two allow for arbitrarily long sequences of bits to be read from the stream. I'm still developing the version I'm writing, so being able to read a sequence of integers (of 8, 16, 32, or 64-bits in nature) that's at an odd bit-index is one of the goals. For example: reading a byte at the 37th bit in the stream. None of those allow such actions.

Alexander Morou · Accepted Answer · 2012-04-24 23:05:49Z

Since the original goal was to focus on a BitVector for use with parsing Common Language Infrastructure (ECMA-335) metadata, I was originally concerned with it lacking the necessary oomph to handle it; however, as it appears the class itself is not really necessary in actually reading the metadata, the performance characteristics of this class are irrelevant.

Secondarily, the original code cannot be accessed as the domain I used to store the data is no longer available.

asked	1 year ago
viewed	249 times
active	1 year ago

BitVector Class: Performance

1 Answer

Your Answer

Not the answer you're looking for? Browse other questions tagged c# performance data-structures bit-twiddling or ask your own question.

BitVector Class: Performance

1 Answer

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged c# performance data-structures bit-twiddling or ask your own question.

Related