Better than regex

Question

Better than regex

I made an application that can extract some specific information from a specific website. For this, I used a regular expression that gives me the desired result. Is there a more efficient process or idea than a regular expression for this simple seeker.

+3

java web-crawler

Toukir naim May 20 '12 at 17:01

source share

1 answer

Radu Simionescu · Answer 1 · 2012-05-20T17:24:01+0000

If you say that this is a simple regular expression that solves your problem, then no, there is no other more effective solution. When it comes to crawling, an alternative would be to load the entire html page in memory, in a DOM document, and search using XPath or even XQuery. But in fact, if the information is easily extracted using regular expressions, then don’t worry, especially if you are not familiar with XPath.

The power of XPath comes when you want to do complex searches. And it is more elegant than regular expression for this task (at least in w3c oppinion). But if you want a quick solution, you have already found it, and it is more efficient in terms of RAM.

Better than regex

More articles: