I work on a news website that stores all their stories as XML. I know, not the best way to go, but this is what it is. What I'm trying to do allows me to search for XML files from a website. Right now, our search function - everything works on Google (it searches only for what Google has already crawled).
What I think right off the bat is to use Grep, which works well, but probably won't scale too much. Another option, which will require much more work, but will work better, is to store parts of XML in a relational database.
Considering how our backend is configured, switching to another storage model will take a lot of time, so for now we have to work. Ideas?
Ray z source
share