As you noticed from the previous chapter, there are trips with negative and extremely huge durations (like more than 50 days). Surely this data can not be real, and we need to fix it if we want to go further.

What is the reason for extremely long trips? Most likely, it happened because some drivers forgot to turn off the taximeter when done with the route. The easiest way to deal with it - is simply to remove them as  outliers. We will remove all the observations with durations greater-equal than 2 days (1-day duration will be investigated).

But what can be the real reason for negative durations? Let's try to find it out. Do not forget about `timedelta` objects, since we want to compare durations (measured in hours, minutes, and seconds; rarely, in days).

> To not convert both columns to `datetime` every time, we can set `parse_dates` argument within `.read_csv` function to list with column names we want to convert.

Dates and(or) times are one of the most common things in modern data: often we need to deal with observations containing dates and times. In this course, you will learn about datetime, pytz libraries, and how they help to deal with different formats of dates and times. Also, you will learn how to implement this knowledge to work with dirty or missing date/time values.

In this section, you will learn how to deal with dates in your data. Specifically, you will learn how to create date objects, how to perform arithmetic operations, like difference, how to convert dates into different local formats, and so on.

In this section, you will learn how to deal with data containing not only dates, but also times, or times only.

In this section, you will learn how to apply and adapt timezones to datetime objects and also how to deal with daylight (or summer) time.

In this section, you will learn how to implement acquired knowledge about manipulation with dates and times for different datasets using pandas library. In particular, you will handle with taxi rides dataset in Mexico City, analyzing it, correcting mistakes, and so on.

Challenge: Investigation

Solução

Challenge: Investigation

Solução