Data Analytics Interview Questions – Set 05

Design a view in a map such that if a user selects any country, the states under that country has to show profit and sales.

According to your question, you must have a country, state, profit and sales fields in your dataset.

  • Double-click on the country field.
  • Drag the state and drop it into Marks card.
  • Drag the sales and drop it into size.
  • Drag profit and drop it into color.
  • Click on size legend and increase the size.
  • Right-click on the country field and select show quick filter.
  • Select any country now and check the view.

Mention how to deal the multi-source problems?

To deal the multi-source problems,

  • Restructuring of schemas to accomplish a schema integration
  • Identify similar records and merge them into single record containing all relevant attributes without redundancy

How should you tackle multi-source problems?

To tackle multi-source problems, you need to:

  • Identify similar data records and combine them into one record that will contain all the useful attributes, minus the redundancy.
  • Facilitate schema integration through schema restructuring.

What is the Interquartile Range?

Shown in a box plot, the interquartile range is the difference between the lower and upper quartile, and is a measure of the dispersion of data. If you’re interviewing for a data analyst job, it’s important to be prepared with a similar answer and to answer confidently.

Which data analyst software are you trained in?

This question tells the interviewer if you have the hard skills needed and can provide insight into what areas you might need training in. It’s also another way to ensure basic competency. In your answer, include the software the job ad emphasized, any experience with that software you have, and use familiar terminology.

Here’s a sample answer:

“I have a breadth of software experience. For example, at my current employer, I do a lot of ELKI data management and data mining algorithms. I can also create databases in Access and make tables in Excel.”

What is the monthly profit of your favorite restaurant?

Pick a small family restaurant and not a chain of restaurants. This should make calculations much easier.

Then define the main parameters of the restaurant that we are talking about:

  • Days of the week in which the restaurant is open
  • Number of tables/seats
  • Average number of visitors:
    – during lunchtime;

– at dinner;

  • Average expenditure:

– per client during lunch;

– per client during dinner.

The restaurant is open 6 days of the week (they are closed on Monday), which means that is open 25 times during lunch and dinner time per month. It is a small family restaurant with around 60 places. On average 30 customers visit the restaurant at lunch and 40 people come to have dinner. The typical lunch menu costs 10 euro, while dinner at this restaurant costs twice that amount – 20 euro. Therefore, they are able to achieve revenues of:

25 (days) * 30 (customers) * 10 (EUR) = 7,500 EUR (lunch)

25 (days) * 40 (customers) * 20 (EUR) = 20,000 EUR (dinner)

The restaurant is able to achieve 27,500 EUR of sales. Besides, the owner and his wife 4 people work there as well. Let’s say that the 3 waiters make 2,000 EUR each and the chef makes 3,000 EUR (including social security contributions). So the cost of personnel is 9,000 EUR. Usually, food and drinks cost around one-third of the overall amount of sales. Therefore the cost of goods sold amounts to 9,125 EUR. Utility and other expenses are another 10% of Sales, so we will have an additional cost of 2,750 EUR. The owners do not pay rent, because they own the place. After the calculations that we made, it results in a monthly profit of (before taxes) 6,625 EUR.

What is the difference between variance and covariance?

Variance and Covariance are two mathematical terms which are used frequently in statistics. Variance basically refers to how apart numbers are in relation to the mean. Covariance, on the other hand, refers to how two random variables will change together. This is basically used to calculate the correlation between variables.

In case you have attended any Data Analytics interview in the recent past, do paste those interview questions in the comments section and we’ll answer them ASAP. You can also comment below if you have any questions in your mind, which you might have faced in your Data Analytics interview.

What is an outlier?

Any observation that lies at an abnormal distance from other observations is known as an outlier. It indicates either a variability in the measurement or an experimental error.

What’s your knowledge of statistics and how have you used it in your work as a data analyst?

Data analysts should have basic statistics knowledge and experience. That means you should be comfortable with calculating mean, median and mode, as well as conducting significance testing. In addition, as a data analyst, you must be able to interpret the above in connection to the business. If a higher level of statistics is required, it will be listed in the job description.

Example
“In my line of work, I’ve used basic statistics – mostly calculated the mean and standard variances, as well as significance testing. The latter helped me determine the statistical significance of measurement differences between two populations for a project. I’ve also determined the relationship between 2 variables in a data set, working with correlation coefficients.”

Name some of the essential tools useful for Big Data analytics.

The important Big Data analytics tools are –

  • NodeXL
  • KNIME
  • Tableau
  • Solver
  • OpenRefine
  • Rattle GUI
  • Qlikview