Welcome, guest | Sign In | My Account | Store | Cart

How to read millions of hexadecimal numbers into a numpy array quickly (Python recipe) by Oren Tirosh
ActiveState Code (http://code.activestate.com/recipes/578177/)

The numpy.fromfile() function supports binary formats or decimal text. How do you read millions of hexadecimal numbers quickly?

      data = numpy.frombuffer(open(filename).read().replace('\n','').decode('hex'), dtype=numpy.uint32).byteswap()

# Slow version, for reference:
numpy.fromiter( (int(x, 16) for x in open(filename)), dtype=numpy.uint32)

Reading the numbers one by one and converting them with int(s, 16) is quite slow. This trick speeds it up by about a factor of 4 and avoid constructing millions of individual python int objects.

Note that this method does not verify the format. It assumes that the input consists of numbers with a fixed width of exactly 8 chars and contains nothing but hexadecimal digits and newlines.

Tags: numpy

Created by Oren Tirosh on Wed, 27 Jun 2012 (MIT)

◄	Python recipes (4591)	►
◄	Oren Tirosh's recipes (16)	►

Required Modules

numpy

Other Information and Tasks

Licensed under the MIT License
Viewed 16946 times
Revision 1

Accounts

Code Recipes

Feedback & Information

ActiveState

© 2024 ActiveState Software Inc. All rights reserved. ActiveState®, Komodo®, ActiveState Perl Dev Kit®, ActiveState Tcl Dev Kit®, ActivePerl®, ActivePython®, and ActiveTcl® are registered trademarks of ActiveState. All other marks are property of their respective owners.

How to read millions of hexadecimal numbers into a numpy array quickly (Python recipe) by Oren Tirosh ActiveState Code (http://code.activestate.com/recipes/578177/)