On 2007-08-31, at 01:42, Tom Russell wrote:
> That one makes sense, a nightmare with the quotes though!
???
What nightmare?
href=["]+...+["]
In this picture you see the RegEx in work ( the red color )
http://www.seedit.info/rb/hrefregex.png
If you want remove the quote and href=, use Replaceall .. ??
Sven E
>
> Sven E Olsson wrote:
>> Links could looks like this:
>> <a class="blue" href="http://www.seedit.info">
>> <a href="http://www.seedit.info" title="Visit seedit.info">
>>
>> There is also examples with Eventhandlers, so you could not search to
>> the last ">"
>>
>>
>> The only that is safe looks to be: href="http://www.seedit.info" and
>> "href=" must be included, because the rest could just be text like:
>> <a href="http://www.seedit.info">www.seedit.info</a>
>>
>> This return: href="http://www.seedit.info" - and then remove "href="
>> href=["]+...+["]
>>
>> ---------
>> Sven E
>>
>> On 2007-08-31, at 00:36, Tom Russell wrote:
>>
>>
>>> I need to parse some info from a web page but not sure how to regEx
>>> it.
>>>
>>> Example would be:
>>>
>>> href="http://myworld.ebay.com/xxxxxxxxx/">
>>>
>>> But I only need the stuff in the quotes.
>>>
>>> I assume my search pattern would be something like:
>>> rg.SearchPattern="href="http:(\D+)>"
>>>
>>> Would this be correct?
>>>
>>> Thanks,
>>>
>>> Tom
>>>
>>>
>>>
>>
_______________________________________________
Unsubscribe or switch delivery mode:
<http://www.realsoftware.com/support/listmanager/>
Search the archives:
<http://support.realsoftware.com/listarchives/lists.html>
|