Skip to content

Concise name updates for StatVars_Batch 1#1218

Open
mandelapro wants to merge 1 commit intodatacommonsorg:mainfrom
mandelapro:concise_name_batch_1_update
Open

Concise name updates for StatVars_Batch 1#1218
mandelapro wants to merge 1 commit intodatacommonsorg:mainfrom
mandelapro:concise_name_batch_1_update

Conversation

@mandelapro
Copy link

Batch 1 update for StatVar concise naming consistency.

  • Updated concise names
  • Improved naming alignment

@google-cla
Copy link

google-cla bot commented Feb 12, 2026

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

@gemini-code-assist
Copy link

Summary of Changes

Hello @mandelapro, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request focuses on enhancing the overall clarity and consistency of statistical variable definitions within the system. By systematically updating the 'name' and 'description' fields across multiple data definition files, the changes aim to provide more precise and user-friendly labels for various statistical indicators. This effort is part of a broader initiative to improve data discoverability and interpretation.

Highlights

  • Statistical Variable Naming: Updated the 'name' and 'description' properties for numerous statistical variables across various MCF files to ensure concise and consistent terminology.
  • Data Clarity: Improved the clarity and alignment of statistical variable definitions, making them easier to understand and use.
Changelog
  • stat_vars/air_ghg_emissions_stat_vars.mcf
    • Added 'name' and 'description' properties to statistical variable nodes for carbon dioxide, carbon monoxide, greenhouse gas, oxides of nitrogen, oxides of sulfur, and volatile organic compound emissions.
  • stat_vars/bea_us_gdp_quarterly_stat_vars.mcf
    • Added 'name' and 'description' properties to statistical variable nodes for US state quarterly industry GDP by NAICS codes.
  • stat_vars/brazil_stat_vars.mcf
    • Added 'name' and 'description' properties to numerous statistical variable nodes related to Brazil's population demographics (age, gender, race), literacy status, student enrollment levels, employment status, economic sectors, and consumer price index data.
  • stat_vars/brazil_visdata_stat_vars.mcf
    • Added 'name' and 'description' properties to statistical variable nodes concerning Brazil's food basket distribution and rural development programs, including various household types and aid distribution partners.
  • stat_vars/brfss_asthma.mcf
    • Added 'name' and 'description' properties to statistical variable nodes for asthma prevalence and standard error in children and adults.
  • stat_vars/calculated_stat_vars.mcf
    • Added 'name' and 'description' properties to statistical variable nodes for specific age-grouped population counts.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request aims to improve naming consistency for statistical variables by adding name and description fields. While this is a valuable effort, the changes introduce a significant number of critical errors, including syntax issues that will break parsing, and logical inconsistencies where names and descriptions do not match the variable IDs. I have highlighted several instances of copy-paste errors, incorrect descriptions, and malformed node definitions. These issues must be addressed to ensure the integrity and usability of the data. I recommend a thorough review of all changes to correct these widespread problems.

Comment on lines +1595 to +1596
name: "Monthly mean income of employed persons aged 14 years and over in the accommodation and food sector"
description: "The monthly mean income of employed people aged 14 years and over in the accommodation and food sector from their main job"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

The name and description here are for the 'accommodation and food sector', but the dcid Monthly_Mean_Income_Person_Years14Onwards_AdminSocialServiceEducation_Employed_MainJob is for 'AdminSocialServiceEducation'. This is a critical copy-paste error that misrepresents the data.

name: "Monthly mean income of employed persons aged 14 years and over in the public administration, defense, social security, education, human health and social services sector"
description: "The monthly mean income of employed people aged 14 years and over in the public administration, defense, social security, education, human health and social services sector from their main job"

Comment on lines +3 to +4
name: "Food baskets delivered to households in emergency care"
description: "The number of food baskets delivered to households in emergency care"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

The name and description refer to 'number of food baskets', but the node's measuredProperty is amount and populationType is Food. The original name was 'Kilograms of food...'. This is an incorrect update, as it changes the meaning of the variable from a weight/amount to a count.

@@ -1,30 +1,33 @@

Node: dcid:StandardError_Person_Children_WithAsthma
name: "asthma Prevalence Standard Error in Children

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

This line is missing a closing quote, which is a syntax error that will cause parsing to fail. Additionally, 'asthma' should be capitalized for consistency with other names and medical terminology.

name: "Asthma Prevalence Standard Error in Children"

literacyStatus: dcid:Illiterate

Node: dcid:Count_Person_10OrMoreYears_Female_Literate
name: "Illiterate Female Population Aged 10+ Years"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

The name 'Illiterate Female Population Aged 10+ Years' contradicts the dcid Count_Person_10OrMoreYears_Female_Literate. This appears to be a copy-paste error. The name should reflect that this variable is for literate females. Similar critical mismatches are present on lines 195 and 205.

name: "Literate Female Population Aged 10+ Years"

Comment on lines +542 to +543
name: "Public School Nursery Students"
description: "The number of students enrolled in nursery programs in public schools"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

The dcid Count_Student_PreKindergarten_PublicSchool indicates this node is for pre-kindergarten students, but the name and description refer to 'Nursery'. This is a critical mismatch that will lead to incorrect data interpretation. Please update the name and description to refer to pre-kindergarten. Similar mismatches exist on lines 552-553 and 562-563.

name: "Public School Pre-K Students"
description: "The number of students enrolled in pre-kindergarten programs in public schools"

Comment on lines +58 to +59
name: “Food Delivered to Extractive Families”
description: "The amount of food delivered to extractive families

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

There are syntax errors here. The name uses curly quotes (“... ”) instead of straight quotes ("..."), and the description is missing a closing quote. These will cause parsing errors. Please use straight double quotes and ensure they are properly closed. This issue is present in multiple places in this file (e.g., lines 68-69, 168, 421).

name: "Food Delivered to Extractive Families"
description: "The amount of food delivered to extractive families"


Node: dcid:Count_Household_ReceivedFoodBasket_FishingFamily
name: "Fishing Families Receiving Food Baskets"
description: "The Number of food baskets that were destined for fishing families"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

The description 'The Number of food baskets that were destined for fishing families' does not match the node Count_Household_ReceivedFoodBasket_FishingFamily. The description is for destined baskets, but the node is for households that received baskets. This is a critical error that misrepresents the data. Similar description mismatches are present on lines 202, 257, 312, and 323.

description: "The number of fishing family households that received food baskets"


Node: dcid:Count_Person_BrazilIndigneous
name: "Brazil Indigenous Population"
description: "The number of people identified as Brazil Indigneous race"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

There is a typo in 'Indigneous'; it should be 'Indigenous'. This typo appears in multiple places within this file (e.g., lines 123, 615, 626). Please correct it throughout for consistency and accuracy.

description: "The number of people identified as Brazil Indigenous race"


Node: dcid:Annual_Amount_Emissions_OxidesOfNitrogen_AsAFractionOf_Count_Person
name: "Annual nitrogen oxide emissions per person"
description: "The annual amount of oxides of nitrogen emissions measured relative to The number of people"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

There is an inconsistent capitalization in the description. The word 'The' should be lowercase to maintain consistency with other descriptions in the file. A similar issue exists on line 120.

description: "The annual amount of oxides of nitrogen emissions measured relative to the number of people"


Node: dcid:Count_Person_16To19Years
name: "Persons Aged 16–19"
description: "The Number of people whose age is between 16 and 19 years"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The word 'Number' is capitalized at the beginning of the description. For consistency with standard sentence case in descriptions, it should be lowercase. This applies to other descriptions in this file as well (lines 13, 22, 31, 40, 49).

description: "The number of people whose age is between 16 and 19 years"

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant