Q1. What do you think being a data scientist is about?

Being a data scientist in my opinion is an amalgamation of having the theoretical knowledge of statistics and its concepts, while also having the ability to apply these theories to solve real world problems and extract insights from huge amounts of data with the help of programming skills. Being a data scientist requires the person to use statistics, programming, and domain/product expertise to solve problems specific to that industry/sector/area.

Q2. What do you see as the major duties and/or knowledge areas?

A data scientist leverages his/her skills to draw meaningful insights from data and allows decisions to be made using these insights. They are responsible to work with data in whichever form or nature, and make the data useful. In a sense, they make data powerful and useful. A data scienstist is expected to not only use the data to derive meaning from, but also clean, manage, and process that data so that the data can later be used for further analysis and modelling purposes. In today’s time, a data scienstist could come from a wide variety of background be it physics, economics, mathematics etc., but the thing that makes them data scientists is that they can use data to draw useful insights in there area of expertise be it finance, medicine, astronomy, etc.,

Another aspect of the duties that are part of being a data scientist is the ability to communicate the findings to the business and management, to allow them to make decisions based off the learnings founds from perfomring analysis on the data. This is a vital reponsibility as this is where the real world impact of perfomring analysis as a data scientist comes into play.

Q3. What differences/similarities do you see between data scientists and statisticians?

The field of data science and statistics has its differences and similarities as per my understanding, some of them are stated below:

Differences:

-The size of data consumed and processed

-Types of problems studied

-Modelling Process

-Statistics is more theoretical versus data science is more application based

Similarities:

-Knowledge of statistics

-Using statistics to extract insights from data

-Solving problems

Q4. How do you view yourself in relation to these two areas?

Personally, I see myself at the convergence of these 2 fields. Primarily because, having gained the theoretical knowledge of statistics has helped me understand on a fundamental level the tools and techniques that can be used to extract meaningful insights from data and having worked on projects which required dealing with large amounts of data in a systematic manner allowed me to deploy those skills to more scalable problems and get similar insights from them using programming and algortihms. Although there is still much to learn and apply in a real world setting which I look forward to doing at some point in my career.


<
Blog Archive
Archive of all previous blog posts
>
Next Post
Project 1 Learnings and Improvements