West Nile Virus Prediction
Data Set Information:
West Nile virus is most commonly spread to humans through infected mosquitos. Around 20% of people
who become infected with the virus develop symptoms ranging from a persistent fever, to serious
neurological illnesses that can result in death. In 2002, the first human cases of West Nile virus were
reported in Chicago. By 2004 the City of Chicago and the Chicago Department of Public Health (CDPH)
had established a comprehensive surveillance and control program that is still in effect today. Every week
from late spring through the fall, mosquitos in traps across the city are tested for the virus. The results of
these tests influence when and where the city will spray airborne pesticides to control adult mosquito
populations. Given weather, location, testing, and spraying data, this competition asks you to predict
when and where different species of mosquitos will test positive for West Nile virus. A more accurate
method of predicting outbreaks of West Nile virus in mosquitos will help the City of Chicago and CP HD
more efficiently and effectively allocate resources towards preventing transmission of this potentially
deadly virus.
Attribute Information:
o Date: date that the WNV test is performed
o Address: approximate address of the location of trap. This is used to send to the GeoCoder.
o Species: the species of mosquitos
o Block: block number of address
o Street: street name
o Trap: Id of the trap
o AddressNumberAndStreet: approximate address returned from GeoCoder
o Latitude, Longitude: Latitude and Longitude returned from GeoCoder
o AddressAccuracy: accuracy returned from GeoCoder
o NumMosquitos: number of mosquitoes caught in this trap
o WnvPresent: whether West Nile Virus was present in these mosquitos. 1 means WNV is present, and
0 means not present
TASK:
There are two datasets train.csv and test.csv
Use information in train.csv to predict whether West Nile Virus was present in the mosquitos in the test
data
Source: