How To Compete With Google With Low Economic Resources


Content from a search engine is very important to have a quality search service for users. A few months ago, I started to think that unless I had the chance to have funding, I would not be able to compete with the big guys due to lack of infrastructure, etc… The funding failed but seems it is not that critical at all.

Current Data

My startup is a new search engine. I have now like 6 million web pages in spanish. Content is very low with respect to Google and Bing, to name a few. But on testing I did, I have average 50% of the content Google has on first page for popular queries. I assume this is not that bad considering that so far I have no modules to get only authoring or popular content.

Fetching Popular Content Module

I am testing a module that creates a database of all the links outbound from those 6 million web pages (which are reasonable relevant). The db is building now, in a few days will have the results. I assume I would be getting like 10 million links. After that point, B+ will only index content from these web pages, leading to a higher percentage content. Anything above 70% will be good, and figures like 90% would be great. This improve will lead to better search results for users.

Why First-Page Google Is Important?

I am not focusing on giving results on first-page from google, but this factor is important to tell how much relevant content B+ has in spanish. The personalization technology and preferences feeded into B+ will allow nice search quality for communities and groups of people. So first page analysis gives me a pretty good estimate that I have most quality content and lack of funding is not ruining the evolution of this search project.

Soon will publish on the blog results

Post to Twitter Tweet This Post Post to Plurk Plurk This Post Post to Digg Digg This Post Post to Ping.fm Ping This Post

,

blog comments powered by Disqus