Methods
Brief notes about data sources, preprocessing, and caveats.
Data
- SA2 Shapefile Geoboundaries provided me with the data to geocode SA2’s.
- Schools in ACT Was my initial source for ACT goverment School opening and closing dates. This was important as to do a longitudinal analysis of walk times, I required information about schools that use to be open in 2001. In addition, I supplemented this dataset with wikipedia article, as the previous dataset hadn’t been updated in recent years. I geocoded each school by searching for it in google maps.
- 2001-2021 ERP Data with ASGS3 boundaries data set provides the Estimated Residential Population from 2001 to 2024. This is the backbone of my analysis,
- Open Street Maps (OSM) Walking Network The OSM walking network is utilized by the dodgr package to created a graph network that can be used to calculated walking distance and speed.
Method and Limitations
- Given computational constraints, walk times per suburb were computed by placing 10 random locations in each sa2, and the walk time to the closest primary or high school was calculated. Then the populations was evenly weighted to each of the random locations and a weighted walk time was estimated for the SA2,
- The “Optimal New School Location” was found by randomly sampling 100 locations in the ACT and then calculating the average weighted walk time to a primary and high school’s if this new schools was included.
Assumptions
Given data limitations, many assumptions had to be made. Some of these assumptions are contained below: - Assume that 5-9 yo are representative of primary school aged children and 10-14 for high school aged children, - Assume that the populations are evenly distributed in each SA2, - Assume that walkways are accessible to all students (which they will not be in reality), - All children go to school, doesn’t account for home schooling, - Assume that schools have infitine capacity. In reality, schools will have a fixed numer of students that they can teach. Future analyis could account for this, and it may result in resonably different findings particularily about where the optimal school placement is. - Some outer suburbs in Canberra were removed because of their small populations.
Further Analysis
- Use forecasting populations to plan new school locations and analyse future walk times,
- Take into account private schools and school capacities,
- Weight the random locaitons by residential areas and increase the size of the random sampling,
- Look at how best to add multiple schools or whether some schools could be moved to better service the community.