Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: stop_times.txt and locations.geojson foreign key validation #1951

Merged
merged 4 commits into from
Jan 28, 2025

Conversation

cka-y
Copy link
Contributor

@cka-y cka-y commented Jan 27, 2025

Summary:
This PR introduces a new validator to ensure that location_id field in stop_times.txt reference valid entries in the locations.geojson file.

Expected behavior:
image

Please make sure these boxes are checked before submitting your pull request - thanks!

  • Run the unit tests with gradle test to make sure you didn't break anything
  • Add or update any needed documentation to the repo
  • Format the title like "feat: [new feature short description]". Title must follow the Conventional Commit Specification(https://www.conventionalcommits.org/en/v1.0.0/).
  • Linked all relevant issues
  • Include screenshot(s) showing how this pull request works and fixes the issue(s)

Copy link
Contributor

📝 Acceptance Test Report

📋 Summary

✅ The rule acceptance has passed for commit 8744a51
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (0 out of 1801 datasets, ~0%) ✅

No changes were detected due to the code change.

Dropped Errors (0 out of 1801 datasets, ~0%) ✅

No changes were detected due to the code change.

New Warnings (0 out of 1801 datasets, ~0%) ✅

No changes were detected due to the code change.

Dropped Warnings (0 out of 1801 datasets, ~0%) ✅

No changes were detected due to the code change.

🛡️ Corruption Check

0 out of 1801 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 3.63 3.72 ⬆️+0.09
Median -- 1.31 1.38 ⬆️+0.07
Standard Deviation -- 10.66 10.63 ⬇️-0.03
Minimum in References Reports ph-unknown-hm-transport-inc-and-robinsons-malls-gtfs-1105 0.46 0.57 ⬆️+0.11
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 295.92 288.04 ⬇️-7.88
Minimum in Latest Reports ar-buenos-aires-subterraneos-de-buenos-aires-subte-gtfs-6 0.50 0.48 ⬇️-0.02
Maximum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 295.92 288.04 ⬇️-7.88
📜 Memory Consumption
Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 455.20 MiB 467.75 MiB ⬆️+12.55 MiB
Median -- 331.92 MiB 331.92 MiB ⬇️0 bytes
Standard Deviation -- 712.06 MiB 788.44 MiB ⬆️+76.39 MiB
Minimum in References Reports us-california-redding-area-bus-authority-raba-gtfs-114 38.79 MiB 411.92 MiB ⬆️+373.13 MiB
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 10.87 GiB 10.70 GiB ⬇️-180.94 MiB
Minimum in Latest Reports ro-vrancea-consiliul-judetean-vrancea-gtfs-1984 415.92 MiB 38.19 MiB ⬇️-377.73 MiB
Maximum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 10.87 GiB 10.70 GiB ⬇️-180.94 MiB

@cka-y cka-y requested a review from davidgamez January 28, 2025 16:22
Copy link
Member

@davidgamez davidgamez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@cka-y cka-y merged commit bfc5ef0 into master Jan 28, 2025
137 checks passed
@cka-y cka-y deleted the feat/1757 branch January 28, 2025 17:22
Copy link
Contributor

📝 Acceptance Test Report

📋 Summary

✅ The rule acceptance has passed for commit 2eccf00
Download the full acceptance test report here (report will disappear after 90 days).

📊 Notices Comparison

New Errors (0 out of 1801 datasets, ~0%) ✅

No changes were detected due to the code change.

Dropped Errors (0 out of 1801 datasets, ~0%) ✅

No changes were detected due to the code change.

New Warnings (0 out of 1801 datasets, ~0%) ✅

No changes were detected due to the code change.

Dropped Warnings (0 out of 1801 datasets, ~0%) ✅

No changes were detected due to the code change.

🛡️ Corruption Check

0 out of 1801 sources (~0 %) are corrupted.

⏱️ Performance Assessment

📈 Validation Time

Assess the performance in terms of seconds taken for the validation process.

Time Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 3.60 3.66 ⬆️+0.05
Median -- 1.31 1.35 ⬆️+0.04
Standard Deviation -- 10.45 10.51 ⬆️+0.06
Minimum in References Reports us-oregon-high-desert-point-gtfs-636 0.46 0.73 ⬆️+0.27
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 282.13 283.61 ⬆️+1.49
Minimum in Latest Reports us-california-city-of-wasco-gtfs-1788 0.47 0.48 ⬆️+0.00
Maximum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 282.13 283.61 ⬆️+1.49
📜 Memory Consumption
Metric Dataset ID Reference (s) Latest (s) Difference (s)
Average -- 462.35 MiB 465.42 MiB ⬆️+3.07 MiB
Median -- 332.54 MiB 333.92 MiB ⬆️+1.39 MiB
Standard Deviation -- 743.54 MiB 762.29 MiB ⬆️+18.75 MiB
Minimum in References Reports ro-vrancea-consiliul-judetean-vrancea-gtfs-1984 38.20 MiB 69.37 MiB ⬆️+31.17 MiB
Maximum in Reference Reports gb-unknown-uk-aggregate-feed-gtfs-2014 10.88 GiB 10.75 GiB ⬇️-139.64 MiB
Minimum in Latest Reports us-colorado-san-miguel-county-gtfs-2195 415.92 MiB 38.98 MiB ⬇️-376.94 MiB
Maximum in Latest Reports gb-unknown-uk-aggregate-feed-gtfs-2014 10.88 GiB 10.75 GiB ⬇️-139.64 MiB

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Flex: add geojson version of foreign_key_violation to support locations.geojson
2 participants