Home All Groups Group Topic Archive Search About
Author
19 Dec 2008 9:56 AM
nomad
Hi,

I need to be able to retrieve values from a string made up of HTML.  A
colleague has mentioned using regular expressions to retrieve the
value but this is proving quite difficult.  If someome could point me
in the right direction in regard to the values below, it would be
greatly appreciated.

<td class="brandorange">Quote reference: 123456789</td> - I need to
retrieve the 123456789 value.

<input type="radio" name="selections.excessBuildings" value="1">
£100<input type="radio" name="selections.excessBuildings" value="2">
£150<input type="radio" name="selections.excessBuildings" value="3"
checked="checked">£300<input type="radio"
name="selections.excessBuildings" value="4"> - I need to be able to
retrieve the checked value i.e. value '3' is checked so I need to
retrieve 300.

Author
19 Dec 2008 2:35 PM
Lav G
I wrote this method for you.
It works.

you have to pass the html string as parameter to this method

        string GetCheckedValue(string inputStringHtml)
        {
            string pattern = ".*checked=\"checked\"[^\\>]*\\>";
            string poundValue = "£0";
            Regex expression = new Regex(pattern);
            if(expression.IsMatch(inputStringHtml))
            {
                string[] split = expression.Split(inputStringHtml,2);
                expression = new Regex("\\<");
                if (expression.IsMatch(split[1]))
                {
                    string[] values = expression.Split(split[1]);
                    poundValue = values[0];
                }
            }
            return poundValue;
        }

Lav G
http://lavbox.blogspot.com
Are all your drivers up to date? click for free checkup

Author
20 Dec 2008 11:31 PM
Tim Roberts
nomad <d.bedg***@o2.co.uk> wrote:
>
>I need to be able to retrieve values from a string made up of HTML.  A
>colleague has mentioned using regular expressions to retrieve the
>value but this is proving quite difficult.  If someome could point me
>in the right direction in regard to the values below, it would be
>greatly appreciated.

In general regular expressions are a terrible way to parse HTML, because
most HTML is not very regular.  A simple regex that matches your current
example would fail if the site decides changes the HTML just a bit.

I suggest you investigate one of the available HTML parsers that can deal
with this variability:

http://www.theserverside.net/discussions/thread.tss?thread_id=36886
--
Tim Roberts, t***@probo.com
Providenza & Boekelheide, Inc.
Author
21 Dec 2008 1:13 AM
Rudy Velthuis
Tim Roberts wrote:

> nomad <d.bedg***@o2.co.uk> wrote:
> >
> > I need to be able to retrieve values from a string made up of HTML.
> > A colleague has mentioned using regular expressions to retrieve the
> > value but this is proving quite difficult.  If someome could point
> > me in the right direction in regard to the values below, it would be
> > greatly appreciated.
>
> In general regular expressions are a terrible way to parse HTML,
> because most HTML is not very regular.

<<
Some people, when confronted with a problem, think "I know, I'll
use regular expressions." Now they have two problems.
-- Jamie Zawinski
Show quoteHide quote
>>


--
Rudy Velthuis        http://rvelthuis.de

"Sanity is a madness put to good uses."
-- George Santayana (1863-1952)

Bookmark and Share