5 Replies - 453 Views - Last Post: 09 September 2018 - 07:18 AM

#1 iiWylde9   User is offline

  • New D.I.C Head

Reputation: 0
  • View blog
  • Posts: 3
  • Joined: 04-September 18

Need some advice as to how I should normalize my data.

Posted 04 September 2018 - 03:09 PM

So I am a beginner in Data Science and I found a nice data set involving bees and pesticide usage that I am experimenting with.

Now, I want to normalize the data because large states like California are going to by default have more bee colonies than small states like Rhode Island per say. Would I able to do something like the use the min-max method to normalize the number of bee colonies then divide by the land area of the state?

I already did some normalization with the pesticides by using a min-max method.

The columns are state, number of colonies, yield per colony, total production, stocks, price per pound, production value, State Name, Region, (specific pesticide compounds) nCLOTHIANIDIN,nIMIDACLOPRID, nTHIAMETHOXAM, nACETAMIPRID, nTHIACLOPRID (end specific pesticide compound), nAllNeonic (previous pesticide compounds added up), and land area of each state ie)California is 155,779 square miles.

For some reason I am drawing a blank as to how I can normalize the number of colonies with land area. Would the min-max method still be viable?

Is This A Good Question/Topic? 0
  • +

Replies To: Need some advice as to how I should normalize my data.

#2 DK3250   User is offline

  • Pythonian
  • member icon

Reputation: 507
  • View blog
  • Posts: 1,604
  • Joined: 27-December 13

Re: Need some advice as to how I should normalize my data.

Posted 04 September 2018 - 11:19 PM

Hello, Welcome to Dream-In-Code.

Your question is not specific for python, more a general software question.
I guess you will get a much better response if the question is located in the 'Software Development' forum.
I have moved it to here.
Was This Post Helpful? 0
  • +
  • -

#3 astonecipher   User is offline

  • Senior Systems Engineer
  • member icon

Reputation: 2769
  • View blog
  • Posts: 10,963
  • Joined: 03-December 12

Re: Need some advice as to how I should normalize my data.

Posted 05 September 2018 - 07:26 AM

https://www.studyton...rmalization.php
http://agiledata.org...malization.html




Not quite sure what you are asking, so I tossed in some links on what normalization is.
Was This Post Helpful? 0
  • +
  • -

#4 ndc85430   User is offline

  • I think you'll find it's "Dr"
  • member icon

Reputation: 975
  • Posts: 3,842
  • Joined: 13-June 14

Re: Need some advice as to how I should normalize my data.

Posted 08 September 2018 - 12:11 AM

No, they're asking about normalising in the numeric sense, rather than the DB sense.

The OP has posted another thread here.

This post has been edited by ndc85430: 08 September 2018 - 12:11 AM

Was This Post Helpful? 1
  • +
  • -

#5 astonecipher   User is offline

  • Senior Systems Engineer
  • member icon

Reputation: 2769
  • View blog
  • Posts: 10,963
  • Joined: 03-December 12

Re: Need some advice as to how I should normalize my data.

Posted 08 September 2018 - 03:03 PM

https://www.datascie...d-normalization


Then?
Was This Post Helpful? 0
  • +
  • -

#6 Skydiver   User is online

  • Code herder
  • member icon

Reputation: 6773
  • View blog
  • Posts: 23,078
  • Joined: 05-May 12

Re: Need some advice as to how I should normalize my data.

Posted 09 September 2018 - 07:18 AM

But doesn't normalization of data presume that the data is normally distributed? Shouldn't he first perform some kind of statistical test to verify that it is normally distributed before just blindly normalizing it?
Was This Post Helpful? 0
  • +
  • -

Page 1 of 1