For our following analysis, we choose three cities San Francisco, Albuquerque and Detroit in following analysis demo.

In our exploration, we want to find the answers for questions that raised by business owners when they want to start or expand their business.

To start up, the business owner need to choose the city that they want to open their business. For this purpose, we are directed to the following question: which city should I consider?

In order to make better decision, an overview of city can help them get some insight, like the culture trend, the consumption level, the gathering places of restaurants. Here we use interactive bar plot to dig that hidden information.

Summary

By clicking the two buttons, we can compare different cities in various metrics.

The city is a large area with various neighborhood districts. So business owners is curious about the business condition of each district, which can help them pin down the location. The interavtive boxplot is utilized to explore the information.

Boxplot for Rating/Price in Different Areas

The orange bar in the boxplot shows the range of mean to 75% quantile while the green bar displays the range of 25% quantile to the mean.

After the city has been decided, the business owner may want to go deeper and narrow down the restaurants to certain category. The following questions are interesting to them.

What's the reasonable price for your business in the city you have chosen? And what rating do you expect to have for your business in the future?

We can know the distribution of price for each category in a city from the interactive bar plot of price. Also the histogram of rating tells us the distribution of rating for each kind of food.

Barplot for price
Histogram for rating

The relationship between price and rating is observed when we check the distribution of rating under different prices. While the price goes up, the rating also goes up. contingency table analysis is used to test this hypothesis.

H0: price and rating are independent VS H1: price and rating are dependent


Contingency Tables for Rating and Price
Rating Level
Price Level 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
$ 6 23 61 177 375 631 816 470 131
$$ 5 7 39 111 325 821 1027 408 132
$$$ 2 3 4 14 34 107 135 43 19
$$$$ 3 3 2 3 3 9 30 27 8

P-value is < 2.2e-16, so p-value is smaller than alpha and we reject the null hypothesis. We are 95% confident that price and rating are related with each other. Therefore, business owners should not lower the price to attract customers without considering other factors like rating.

Other Questions:

Which attributes play a major role when it comes to attracting more customers, like Wifi, Parking, ambience etc.?