On Aug 30, 2007, at 6:51 PM, Tom Russell wrote:
> Tom Russell wrote:
>> Charles Yeomans wrote:
>>
>>> On Aug 30, 2007, at 6:36 PM, Tom Russell wrote:
>>>
>>>
>>>
>>>> I need to parse some info from a web page but not sure how to regEx
>>>> it.
>>>>
>>>> Example would be:
>>>>
>>>> href="http://myworld.ebay.com/xxxxxxxxx/">
>>>>
>>>> But I only need the stuff in the quotes.
>>>>
>>>> I assume my search pattern would be something like:
>>>> rg.SearchPattern="href="http:(\D+)>"
>>>>
>>>> Would this be correct?
>>>>
>>>>
>>> What happened when you tried it?
>>>
>>> Charles Yeomans
>>> _______________________________________________
>>> Unsubscribe or switch delivery mode:
>>> <http://www.realsoftware.com/support/listmanager/>
>>>
>>> Search the archives:
>>> <http://support.realsoftware.com/listarchives/lists.html>
>>>
>>>
>>>
>>>
>> Im getting a syntax error. I think my quotes are screwy
>>
>> _______________________________________________
>> Unsubscribe or switch delivery mode:
>> <http://www.realsoftware.com/support/listmanager/>
>>
>> Search the archives:
>> <http://support.realsoftware.com/listarchives/lists.html>
>>
>>
>>
> Fixed the quote issue. Found out Im doing my regex before the content
> loaded into my socket.
>
> It would probably be better to load my content into a string array
> rather than use the string itself? When I do my regex it pulls the
> first
> one it finds and its not even the one I want.
>
> plus Im getting part of what I want: //www.ebay.com"
If you want to include quotes as part of the search string, use hex
-- \x22 is ".
For what you want, I'd suggest starting with the search pattern
\x22(.+)\x22
and refine as needed. match.SubexpressionString(1) should give the
part inside the quotes.
Charles
_______________________________________________
Unsubscribe or switch delivery mode:
<http://www.realsoftware.com/support/listmanager/>
Search the archives:
<http://support.realsoftware.com/listarchives/lists.html>
|