Welcome, guest | Sign In | My Account | Store | Cart

Byte Frequency Analyzer (Python recipe) by Stephen Chappell
ActiveState Code (http://code.activestate.com/recipes/578361/)

When beginning to compress a file or studying it to break certain forms of encryption, sometimes it is helpful to know how many bytes of a certain category are in a file. This recipe is a simple frequency analysis tool that may be helpful towards that end and can provide a starting point for those interested tools for such fields.

      import os
import sys

def main():
    try:
        table = [0] * 256
        data = open(sys.argv[1], 'rb')
        buff = data.read(2 ** 20)
        while buff:
            for c in buff:
                table[ord(c)] += 1
            buff = data.read(2 ** 20)
        data.close()
        sys.stdout.write('\n'.join('%02X = %d' % (i, c) for i, c in enumerate(table) if c))
    except:
        sys.stdout.write('Usage: %s <filename>' % os.path.basename(sys.argv[0]))


if __name__ == '__main__':
    main()

      

Tags: demonstration

1 comment

beni hess 11 years, 4 months ago # | flag

Others might just call it histogram ;)

Created by Stephen Chappell on Wed, 5 Dec 2012 (MIT)

◄	Python recipes (4591)	►
◄	Stephen Chappell's recipes (233)	►

Required Modules

Other Information and Tasks

Licensed under the MIT License
Viewed 7337 times
Revision 1

Accounts

Code Recipes

Feedback & Information

ActiveState

© 2024 ActiveState Software Inc. All rights reserved. ActiveState®, Komodo®, ActiveState Perl Dev Kit®, ActiveState Tcl Dev Kit®, ActivePerl®, ActivePython®, and ActiveTcl® are registered trademarks of ActiveState. All other marks are property of their respective owners.

Byte Frequency Analyzer (Python recipe) by Stephen Chappell ActiveState Code (http://code.activestate.com/recipes/578361/)