Charles Yeomans wrote:
> On Aug 30, 2007, at 6:51 PM, Tom Russell wrote:
>
>
>> Tom Russell wrote:
>>
>>> Charles Yeomans wrote:
>>>
>>>
>>>> On Aug 30, 2007, at 6:36 PM, Tom Russell wrote:
>>>>
>>>>
>>>>
>>>>
>>>>> I need to parse some info from a web page but not sure how to regEx
>>>>> it.
>>>>>
>>>>> Example would be:
>>>>>
>>>>> href="http://myworld.ebay.com/xxxxxxxxx/">
>>>>>
>>>>> But I only need the stuff in the quotes.
>>>>>
>>>>> I assume my search pattern would be something like:
>>>>> rg.SearchPattern="href="http:(\D+)>"
>>>>>
>>>>> Would this be correct?
>>>>>
>>>>>
>>>>>
>>>> What happened when you tried it?
>>>>
>>>> Charles Yeomans
>>>> _______________________________________________
>>>> Unsubscribe or switch delivery mode:
>>>> <http://www.realsoftware.com/support/listmanager/>
>>>>
>>>> Search the archives:
>>>> <http://support.realsoftware.com/listarchives/lists.html>
>>>>
>>>>
>>>>
>>>>
>>>>
>>> Im getting a syntax error. I think my quotes are screwy
>>>
>>> _______________________________________________
>>> Unsubscribe or switch delivery mode:
>>> <http://www.realsoftware.com/support/listmanager/>
>>>
>>> Search the archives:
>>> <http://support.realsoftware.com/listarchives/lists.html>
>>>
>>>
>>>
>>>
>> Fixed the quote issue. Found out Im doing my regex before the content
>> loaded into my socket.
>>
>> It would probably be better to load my content into a string array
>> rather than use the string itself? When I do my regex it pulls the
>> first
>> one it finds and its not even the one I want.
>>
>> plus Im getting part of what I want: //www.ebay.com"
>>
>
> If you want to include quotes as part of the search string, use hex
> -- \x22 is ".
>
> For what you want, I'd suggest starting with the search pattern
>
> \x22(.+)\x22
>
> and refine as needed. match.SubexpressionString(1) should give the
> part inside the quotes.
>
> Charles
> _______________________________________________
> Unsubscribe or switch delivery mode:
> <http://www.realsoftware.com/support/listmanager/>
>
> Search the archives:
> <http://support.realsoftware.com/listarchives/lists.html>
>
>
>
I am not sure what you mean here. Instead of using quotes, use this? And
wrap my stuff in that?
\x22(.+)href="http:(\D+)>\x22
_______________________________________________
Unsubscribe or switch delivery mode:
<http://www.realsoftware.com/support/listmanager/>
Search the archives:
<http://support.realsoftware.com/listarchives/lists.html>
|