NUT: RegExp for finding opening HTML tag

I’m working with printing in flash and I need to span long articles over several pages. I have worked out a solution for finding the overflowing text for each page. But the body text in the articles are formatted with HTML and CSS, so I cannot just split the text anywhere.

Example:

bodyField.htmlText results look like this:

... <P ALIGN="LEFT"><FONT FACE="Times Roman">Morbi non lectus et purus bibendum dictum</FONT></P><P ALIGN="LEFT"><FONT FACE="Times Roman">Morbi non lectus et purus bibendum dictum</FONT></P>...

If now my overflow algorithm finds that this text needs to be split up at character index 52 (the character “n” in “Morbi non”), then that leaves me without a closing P-tag at the current page and an opening P-tag on the next.

I would like to be able to find the character index of the “<” character that opens the P-tag that the split point character is within. Basically I would like to match any previous instance of “<P” before the given character index.

Any ideas, you hard core reguar expression folks out there?