I'm trying to get some movie information from IMDb.com (I actually got connected to the damn page
) but it seems that my regex skills are a bit rusty ATM.
So, the problem is that there's data like:
Code: Select all
<b class="blackcatheader">Directed by</b><br>
<a href="/name/nm0905152/">Andy Wachowski</a><br><a href="/name/nm0905154/">Larry Wachowski</a>
and I'm trying to get all directors from that data. So far I have the following regex:
Code: Select all
(?:<b\s+[^>]*>Directed by</b><br>\s?\n){1}(?:<a\s+[^>]*>([^<]*)</a>(?:<br>)?)*
Which
almost works. The problem with the above regex is that it captures all directors to the same group (
screenshot1.png) even though I need them to go to their own groups. Any suggestions?
I'm using
boost::regex_search.