Introduction
Many Northeastern students lead busy commuting lives, navigating between co-ops, classes, housing, and throughout our lovely city of Boston. There is a schism that arises in that busy life though, something difficult for not only Northeastern students but for working adults: time management. Managing getting to and from classes, making it off campus to meet your friends out for dinner, or even just getting to the grocery store can be a hassle. From our experience, one of the most common ways students do this is by utilizing Bluebikes, Boston's self powered transit system that allows you to get around the city with ease. Going into this project, we wanted to learn how people at Northeastern are using Bluebikes throughout the year. So, we decided to research one month from each season during the school year to find out more.
Questions we wanted to answer included:
- How do Northeastern students trips fluctuate from season to season?
- What factors affect Bluebike trips the most?
- To what extent to people utilize the Northeastern Bluebike stops and how?
Reference papers:
Introducing the data
For our research, we decided to retrieve our data straight from the Bluebike website, which provides
comprehensive trip information for every month since 2016. To keep our analysis focused and up to
date, we used only the months of April, September, and December of 2024, representing three
different points in the school year. To support our mapping visualizations, we retrieved a shapefile
for Massachusetts from the official Massachusetts government website.
For our Bluebike data, each month contains approximately 600,000 rows of data. However, as we are
only wanting to study Northeastern University usage, we filtered the data by only keeping the stops
going to or from our campus. In doing this we were able to get one dataset that combined each month
down to only 17,000 rows.
Regarding the attributes of our data, we were given an extensive list of features for analysis. Each
file contains the start and end points (coordinates, ID, name), time taken, member type, bike type,
and the trip ID. These features provided a solid foundation that allowed us to further investigate
how the Northeastern community utilizes Bluebikes throughout the academic year.
Visualization 1
These two maps showcase all of the trips made to and from the ruggles station in the time window we examined. The first map shows the trips made originating at the ruggles station, while the second map shows the trips made that ended at the ruggles station. The points indicate stations, with the color of the points encoded according to the number of trips originating from/ending at that station, logarithmically scaled. The outgoing map shows that most activity is clustered around the ruggles station, indicating shorter trips overall to neighborhoods like Fenway, Back Bay, and Jamaica Plain. The incoming map shows rides being much more evenly distriubted across the city, with some closeby stations still having the the most activity. One possible explanantion of this trend is the fact that ruggles is a transportation hub, with connections to the orange line, commuter rail, and many bus routes. People taking bluebikes to ruggles may be using them as a part of a longer trip.
Visualization 2
From this visualization, we just wanted to get a general feel for how each attribute was different for each of the three months. Starting by analyzing the chart looking at member types, we can see that casual users by far had the largest amount of trips, with the highest overall amount of trips being in September. April followed with a moderate amount of usage, and December had a major drop, which we believe we can attribute to the influx of cold weather and winter break. From the visualization showing the bike types, we can see that people overwhelmingly favored the classic bikes over the electric bikes in each of the months without much of a change in the ratio. Finally, when looking at the average ride duration for each of the months we can see that the time was fairly consistent throughout each of them. There was a slight decline in December at an average of ten minutes as opposed to the twelve in April and September, which we again believe we can attribute to the colder weather. Overall, this data showed us that there was a strong difference in usage between the seasons and that there is a preference in bike and member types.
Visualization 3
This boxplot we created compares the Manhattan Distance between the start and end stops of each trip traveled by electric and classic Bluebikes in each of our research months. Looking at the data from April, we can see that the trips are generally shorter and fairly consistent, with electric bikes having a larger range with more variation than their classic counter parts. In the month of September, we see the largest general trip length as well as variability, with the electric bikes haviung the largest number of high outliers. Finally, looking at the trips in December, they are slightly shorter than the other months of interest and generally have more uniform distances between both of the bike types. Just as with the previous graphs we will attribute this decrease to the colder weather as compared to the other months. From this visualization, we were able to learn how overall, electric bikes tend to support longer and more variable ride distances when compared with classic bikes.
Visualization 4
This chart indicates the amount of rides starting in each hour of the day, over the course an entire month for the Ruggles station. The x-axis shows the hour of the day, while the y-axis shows the number of rides starting at that hour. This chart once again gives persepctive on the difference in quantity of rides between the months we are examining with drastic differneces between them. The main trend demonstrated by this chart is the peak times of day by quantity of rides. There are consistent peaks across all three months between the hours of 7 and 9 in the morning, and 4 and 6 in the evening. This indicates that the majority of rides are being taken during commuting hours, despite the fact that the station is located near a college campus where most people typically don't need to commute. The commuting trend still being present could be due to factors such as Northeastern's co-op program, or the nearby orange line and residential areas making the station a popular stop for commuters.
Visualization 5
This heatmap was chosen to uncover when during the week Bluebike usage is most concentrated, providing a temporal lens to complement our spatial and demographic analyses. By breaking rides down by hour and day, and enabling filtering by month, this visualization reveals distinct commuter behavior patterns and seasonal fluctuations. For instance, September shows pronounced morning and evening spikes on weekdays—consistent with school and work commutes—while April and December exhibit more diffuse or weather-dependent patterns. This interaction helps contextualize ride volume changes within daily life rhythms, offering insights into infrastructure usage and rider intent beyond aggregate trip counts.
Summary of findings
Through our five interactive visualizations, we’ve uncovered several key trends about Bluebike usage at and around Northeastern University:
- Volume Fluctuations by Month: Ride activity peaks in September and April, aligning with the start of academic semesters and milder weather. December sees a sharp decline, likely due to colder temperatures and holiday travel.
- Membership Trends: Members consistently outpace casual riders in volume, especially during commuting hours, suggesting that many users treat Bluebike as part of their daily routine rather than a recreational service.
- Station Dynamics: Ruggles T Stop stands out as the central hub for both ride departures and arrivals, reinforcing its importance as a multi-modal transit node connecting students to the broader city network.
- Trip Duration and User Behavior: Casual riders tend to take longer trips, especially on weekends, which aligns with leisure-oriented usage. Members take shorter, more frequent trips, especially during weekdays.
- Temporal Patterns of Use: The final heatmap visualization highlights distinct usage rhythms throughout the day. Weekdays show strong AM and PM peaks—particularly in September—supporting the idea of Bluebike as a commuting tool. In contrast, weekends and December rides occur more evenly throughout the day, likely reflecting recreational or off-peak usage.
Together, these findings point to a system that supports both structured commuting and flexible recreation, with demand shaped strongly by seasonality, academic calendars, and time of day. These insights can inform better bike availability planning, infrastructure investment, and targeted promotions to maximize ridership throughout the year.