I am having a problem in splitting an HTML docuement stored in a text file
according to the <div> tag.
I can split the document into paragraphs using:
p = re.compile('[\n]') s2 = p.sub('',s)
where s is the text
but how to split according to <div (not <div> or </div>)???
I've tried to use:
p = re.compile('<div') s2 = p.sub('',s)
but it did not work.