Monthly Archives: November 2015

Degree thesis subjects

I saw recently an event at an university in Romania (Universitatea Politehnica Bucuresti) that aims to help students to choose the subject for their degree thesis. At this event companies are invited to  present themes in front of students. You will find below a short list of themes related to our industry:

1. Automatic website classification

Possible categories: e-commerce, company website, news/blog, other.

2. Detecting website structure (and representing as a tree)

E.g. The first level of an online store contains main categories, second level sub categories and n level product page. The entire website can be represented as a tree.

3. Logo detection on internet

When detecting logos on a website page there are multiple issues that might occur. For example: many logos in same image, scaled logos.

Please let us know if you want to develop one of the above themes, and we will help you with results of our research.

 

An Internet filter engine

filterI always thought that companies have needs that are different from those of end users (see classification by target, B2C or B2B). And I think that this hypotheses is also true in internet area. These days I was busy with developing a TheWebMiner Filter and I want to talk in the following lines about internet search.

What is internet searching?

What I understand (and maybe many of you) by search is sorting. Google, Bing and other search engines try hard to find most relative page for our query and results are impressive. A colleague of mine told me that if you describe a movie scenario in a Google search, Google will find the Wikipedia page of movie. But this is an end user point of view.

Continue reading