Performing Analysis Of Meteorological Data

In this post, I am going to analyze the data from the Weather data-set of Finland, a country in Northern Europe. The dataset has hourly temperature recorded for the last 10 years starting from 2006–04–01 00:00:00.000 +0200 to 2016–09–09 23:00:00.000 +0200. You can find the data-set on Kaggle. I am going to use the pandas and the matplotlib libraries of Python.

A null hypothesis to be considered is : “Ho : Has the Apparent temperature and humidity compared monthly across 10 years of the data indicate an increase due to Global warming”

The Ho means we need to find whether the average Apparent temperature for the month of a month say April starting from 2006 to 2016 and the average humidity for the same period have increased or not.

So let’s move towards to implementation part.

I have used Jupyter Notebook for implementing python code.

Firstly, we will import all necessary libraries.

Now, we will load our dataset using pandas’s read_csv

Now we will check the size of our dataset

Now, we will clean our data. For cleaning we will remove or drop unwanted columns.

Here, we want only two columns for our analysis. so except that two columns which is shown is figure drop all the unwanted columns.

Now we need to convert the type of “Formatted Date” field from object to proper DateTime format.

Now, Before resampling our data from day to month we have to set ‘Formatted Date’ as Index

After setting ‘Formatted Date’ as an index we will resample our data from Day to Month and calculate the mean.

Now we plot the graph to visualize the variation of temperature and humidity throughout the years.

Variation of Apparent Temp. Vs Humidity in 10 years

Here, we can see that in this 10 years of Dataset, Apparent temperature and humidity are not related. For all the year monthly average humidity is the same but the Apparent temperature is different. Global warming is affecting the earth’s temperature so that we see some uncertainty in this data.

Now we plot the graph to visualize the variation of temperature and humidity throughout the years for a specific month.

January

So we can observe that the temperature of January is varying throughout those years but the average humidity is constant. The lowest temperature was recorded was below -4°C and the highest temperature was above 2°C. We can clearly see that there is a sharp rise in temperature in the year 2007 whereas there is a fall in temperature in the year 2010. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

February

So we can observe that the temperature of February is varying throughout those years but the average humidity is constant. The lowest temperature was recorded was below -8°C in 2012 and the highest temperature was above 2–3°C. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

March

So we can observe that the temperature of March is varying throughout those years but the average humidity is constant. The minimum temperature was recorded was below 2°C and the maximum temperature was above 8°C. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

April

So we can observe that the temperature of April is varying throughout those years but the average humidity is constant so there is no relationship between Temperature and humidity. The minimum temperature was recorded was below 11°C and the maximum temperature was above 14°C. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

May

So we can observe that the temperature of May is varying throughout those years but the average humidity is constant. The lowest temperature was recorded was below 16°C and the highest temperature was above 17.5°C. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

June

So we can observe that the temperature of June is varying throughout those years but the average humidity is constant. The lowest temperature was recorded was 20°C and the highest temperature was above 20°C. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

July

So we can observe that the temperature of July is varying throughout those years but the average humidity is constant. The lowest temperature was recorded was 22°C and the highest temperature was above 24°C. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

August

So we can observe that the temperature of August is varying throughout those years but the average humidity is constant. The lowest temperature was recorded was 19°C and the highest temperature was above 24°C. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

September

Here, we can observe that the temperature of September is varying throughout those years but the average humidity is constant. The lowest temperature was recorded was 14°C and the highest temperature was above 19°C. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

October

Here, we can observe that the temperature of October is varying throughout those years but the average humidity is constant. The lowest temperature was recorded was 8°C in 2007 and the highest temperature was above 12°C. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

November

Here, we can observe that the temperature of November is varying throughout those years but the average humidity is constant. The lowest temperature was recorded was 1°C in 2007 and the highest temperature was above 7°C in 2010. Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

December

Here, we can observe that the temperature of December is varying throughout those years but the average humidity is constant. The lowest temperature was recorded was -4°C in 2013 and the highest temperature was above 0°C . Hence we can conclude that global warming has caused uncertainty in temperature over the past 10 years while the average humidity has remained constant throughout the 10 years.

For Code and Dataset refer to my Github repository

I am thankful to mentors at https://internship.suvenconsultants.com for providing awesome problem statements and giving many of us a Coding Internship Experience. Thank you www.suvenconsultants.com.

A Information Technology Student with a focus in Data Science, Machine Learning, Android Development