Author Archives: Adrian Balcan

How many gas stations are in United States?

Gas Station

In case you are curios about the number of gas stations in United States, we have a study for this. There are about 133,000 gas stations in US. This means 1,900 cars per gas station.

For more information check out our new tool TheWebMiner GEO.

Image source: http://www.prevention.com/sites/prevention.com/files/styles/article_main_image_2200px/public/images/articles/featured_images/Getty-154226161-Gas_station_night-Hans_Engbers_0.jpg?itok=tRCUbQMd

How many hotels are in United States?

Hotel

Recently we made a study about number of hotels in US. We have discovered that are 37,000 hotels in United States. This means that we have 8,300 inhabitants per hotel, but obviously the hotels are for tourists. Our math shows that 2,000 tourists go to sit in a hotel on year.

For more geo data you can use our tool TheWebMiner.com/geo

Image source: http://i.telegraph.co.uk/multimedia/archive/02323/HOTEL_2323685b.jpg

How to mount an existing EBS to an amazon instance

By existing, I want to say a non empty EBS, a formatted device :)

It’s very simple:

1. Use lsblk  command to view all attached devices:

[ec2-user ~]$ lsblk
NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT
xvdf 202:80 0 22G 0 disk
xvda1 202:1 0 8G 0 disk /

2. create dir and mount device:

[ec2-user ~]$ sudo mkdir /mnt/my-data
[ec2-user ~]$ sudo mount /dev/xvdf /mnt/my-data

Degree thesis subjects

I saw recently an event at an university in Romania (Universitatea Politehnica Bucuresti) that aims to help students to choose the subject for their degree thesis. At this event companies are invited to  present themes in front of students. You will find below a short list of themes related to our industry:

1. Automatic website classification

Possible categories: e-commerce, company website, news/blog, other.

2. Detecting website structure (and representing as a tree)

E.g. The first level of an online store contains main categories, second level sub categories and n level product page. The entire website can be represented as a tree.

3. Logo detection on internet

When detecting logos on a website page there are multiple issues that might occur. For example: many logos in same image, scaled logos.

Please let us know if you want to develop one of the above themes, and we will help you with results of our research.

 

An Internet filter engine

filterI always thought that companies have needs that are different from those of end users (see classification by target, B2C or B2B). And I think that this hypotheses is also true in internet area. These days I was busy with developing a TheWebMiner Filter and I want to talk in the following lines about internet search.

What is internet searching?

What I understand (and maybe many of you) by search is sorting. Google, Bing and other search engines try hard to find most relative page for our query and results are impressive. A colleague of mine told me that if you describe a movie scenario in a Google search, Google will find the Wikipedia page of movie. But this is an end user point of view.

Continue reading