Data Analysis with Python

How to analyze Data with Python?

If you are a software developer then it can be felt some tricky how to use Python for Data Analysis. Unless you had an experiences on Data Analysis with Python yourself, it can be an unknown world.

- Data Analysis
Recently we have been told many times about terms of 'Big Data' and 'Growth Hacking'. IOT(Internet Of Thing) term indeed came from 'Ubiquitous' world. As same as this way Big Data and Growth Hacking are also actually came from 'Data Warehouse' to make it familiar with to people. So now how these terms can be applied to which area and how to be.

There are many processes to make huge data valuable. Data collection, Extraction, Clustering, Analyzing and Reporting...we call the person as Data Scientist doing those processes. Data Scientist should have the domain knowledge and the technology to manipulate those different types of data and know how to use it. The collected or extracted data can be called Big Data. Based on this Big Data, Growth Hacking can be referred to Analyzing and applied to Marketing purposes.

-Manipulate Data with Programming Language
(Reference: http://blog.revolutionanalytics.com/2014/12/oreilly-data-scientist-salary-and-tools-survey-november-2014.html, Tool usage rate for Data Analysis and Salary)

This graph describes the salary base and the tool usage rate for Data Analysis which surveyed Data Scientists staying around 41 states of USA and 53 countries. 

You may be more interested in the salary first but let it move on to the tools anyway in this time. Here brings Programming languages, Database, Hadoop, Visualization tool, BI program, Operating System, Statistics packages etc.

As you see SQL, Excel, R, Python are ranked at the top area. As DBMS, MySQL, MS SQL Server, Oracle, MongoDB, PostgreSQL ordered from the top. 

- Approaching to Data with Programming Language
Python programming language does not need the intermediate object because Python is able to access data and operates directly data in DBMS. Only one tricky point is this is a programming language so that general users are feeling trouble to start Python themselves this makes they prefer to use 'R' instead.

However there is nothing more easier than Python start. You can download here(https://www.python.org/downloads/).


After you execute the installation file then will see the Python interpreter screen as below.

Then just enter :
print "Data Munging, Wrangling, Processing" 
(enter)

That's it!
You will see the "Data Munging, Wrangling, Processing"

This is a start of Python programming. You can find more and more resources around the internet for Data Analysis and so on.




Comments

Popular posts from this blog

Primal problem and Dual problem

What is the Shadow price?

Why a negative coefficient of variable means it is not optimal in simplex method?