Welcome, guest | Sign In | My Account | Store | Cart

longest common substring (Python recipe) by yota
ActiveState Code (http://code.activestate.com/recipes/578465/)

Return, more than the substring itself, the position of the said substring, relative to each string passed in parameter.

String is a generic term. Here, it is an array, any object with __getitem__() method should work.

      #!/usr/bin/env python3.2

import numpy as np

def longest_common_substring(src, dst) :
	c = np.zeros((len(src), len(dst)), dtype=np.int)
	z = 0
	src_m = None
	dst_m = None
	for i in range(len(src)) :
		for j in range(len(dst)) :
			if src[i] == dst[j] :
				if i == 0 or j == 0 :
					c[i,j] = 1
				else :
					c[i, j] = c[i-1, j-1] + 1
				if c[i, j] > z :
					z = c[i, j]
				if c[i, j] == z :
					src_m = (i-z+1, i+1)
					dst_m = (j-z+1, j+1)
			else :
				c[i, j] = 0
	return src_m, dst_m
	
>>> a = """Lorem ipsum dolor sit amet consectetur adipiscing
	elit Ut id nisl quis lacus lobortis egestas id nec turpis""".split()
>>> b = """Lorem ipsum lobortis dolor sit adipiscing elit dolor
	amet consectetur Ut id nisl quis lacus egestas id nec turpis""".split()
>>> src_m, dst_m = longest_common_substring(a, b)
>>> print(src_m[0], src_m[1])
8 13
>>> print(a[src_m[0]:src_m[1]])
['Ut', 'id', 'nisl', 'quis', 'lacus']
>>> print(dst_m[0], dst_m[1])
10 15
>>> print(b[dst_m[0]:dst_m[1]])
['Ut', 'id', 'nisl', 'quis', 'lacus']

      

Tags: longest_common_substring

Created by yota on Tue, 19 Feb 2013 (GPL3)

◄	Python recipes (4591)	►
◄	yota's recipes (13)	►

Required Modules

numpy

Other Information and Tasks

Licensed under the GPL 3
Viewed 7452 times
Revision 4 (updated 11 years ago)

Accounts

Code Recipes

Feedback & Information

ActiveState

© 2024 ActiveState Software Inc. All rights reserved. ActiveState®, Komodo®, ActiveState Perl Dev Kit®, ActiveState Tcl Dev Kit®, ActivePerl®, ActivePython®, and ActiveTcl® are registered trademarks of ActiveState. All other marks are property of their respective owners.

longest common substring (Python recipe) by yota ActiveState Code (http://code.activestate.com/recipes/578465/)