Feature Request: Provide option to reject a script if using unapproved packages #168

parmsam-pfizer · 2023-03-03T14:25:54Z

Feature Idea

Allow the user to select a "ignore", "warn", "error" with a "warn" default when running aexecute() on unapproved packages. The aexecute function could show the selected diagnostic message for unapproved package use to the user. This would be a good feature enhancement to the vignette on logging unapproved packages. I think this could be added in the part of the log_write function that checks for log.rx.approved:

logrx/R/log.R

Lines 233 to 243 in ef8428e

    
           if (file.exists(getOption("log.rx.approved"))) { 
        
              approved_functions <- readRDS(getOption("log.rx.approved")) 
        
              unapproved_functions <- get_unapproved_use(used_functions, approved_functions) 
        
              set_log_element("unapproved_packages_functions", unapproved_functions) 
        
              cleaned_log_vec <- c(cleaned_log_vec, 
        
                                   write_log_header("Unapproved Package and Functions"), 
        
                                   write_unapproved_functions()) 
        
              cleaned_log <- cleaned_log[!(names(cleaned_log)) %in% "unapproved_packages_functions"] 
        
           }

Curious about others thoughts on this.

Relevant Input

No response

Relevant Output

No response

Reproducible Example/Pseudo Code

No response

The text was updated successfully, but these errors were encountered:

nicholas-masel · 2023-08-29T16:14:02Z

I like the feature, but I'm wondering what the expected outcome is here?

Function and package use is found after the script has executed. Is the expectation is the script would error immediately when an unapproved function is used?

parmsam-pfizer · 2023-08-29T16:23:27Z

Yes, that's a great point you raise, @nicholas-masel. I think currently we can only support the feedback after script execution. That's what I was leaning towards. Not sure if we would have the ability to find the functions/packages used prior to executing the script.

thomas-neitmann · 2023-09-22T13:27:41Z

How about parsing the script prior to executing and then looking for all instances of library(), require() or pkg::fun() to discover any packages used? That way the script would not even get executed.

thomas-neitmann · 2023-09-22T14:41:13Z

So I found this to be an intriguing problem and went down the rabbit hole. Here's what I've come up with:

library(testthat)

extract_used_pkgs <- function(script) {
  ast <- parse(script)
  
  .extract_used_pkgs <- function(expr) {
    if (!is.call(expr)) {
      return(character())
    }
    
    if (expr[[1L]] == quote(library) || expr[[1L]] == quote(require) || expr[[1L]] == quote(`::`)) {
      if (isFALSE(expr$character.only) || is.null(expr$character.only)) {
        return(as.character(expr[[2L]]))
      }
      stop("Do not set `character.only` to `TRUE` inside `library()` or `require()`", call. = FALSE)
    }
    
    unlist(lapply(expr, .extract_used_pkgs))
  }
  
  pkgs <- lapply(ast, .extract_used_pkgs)
  unique(unlist(pkgs))
}

test_that("packages attached with `library()` are detected", {
  script <- c(
    "library(dplyr)",
    "library(ggplot2)",
    "library(stringr)",
    "x <- 0",
    "y <- x + 1"
  )
  file <- tempfile()
  writeLines(script, file)
  
  expect_identical(extract_used_pkgs(file), c("dplyr", "ggplot2", "stringr"))
})

test_that("packages attached with `require()` are detected", {
  script <- c(
    "require(dplyr)",
    "require(ggplot2)",
    "require(stringr)",
    "x <- 0",
    "y <- x + 1"
  )
  file <- tempfile()
  writeLines(script, file)
  
  expect_identical(extract_used_pkgs(file), c("dplyr", "ggplot2", "stringr"))
})

test_that("packages loaded with `::` are detected", {
  script <- c(
    "adsl <- haven::read_sas('./data/adsl.sas7bdat')",
    "dplyr::select(adsl, USUBJID, AGE, SEX)"
  )
  file <- tempfile()
  writeLines(script, file)
  
  expect_identical(extract_used_pkgs(file), c("haven", "dplyr"))
})

test_that("packages loaded with `::` are detected", {
  script <- c(
    "library('magrittr')",
    "adsl <- haven::read_sas('./data/adsl.sas7bdat')",
    "adsl %>% dplyr::select(USUBJID, AGE, SEX) %>% tidyr::filter(SEX == 'M')",
    "pkg1::select(pkg2::filter(adsl, SEX == 'M'), USUBJID, AGE, SEX)"
  )
  file <- tempfile()
  writeLines(script, file)
  
  expect_identical(extract_used_pkgs(file), c("magrittr", "haven", "dplyr", "tidyr", "pkg1", "pkg2"))
})

test_that("error when `character.only = TRUE`", {
  script <- c(
    "pkg1 <- 'dplyr'",
    "pkg2 <- 'ggplot2'",
    "library(pkg1, character.only = TRUE)",
    "require(pkg2, character.only = TRUE)"
  )
  file <- tempfile()
  writeLines(script, file)
  
  expect_error(extract_used_pkgs(file))
})

parmsam-pfizer · 2023-09-22T14:44:17Z

Great suggestion! That's exactly what we discussed at our last logrx team meeting. Thanks for sharing your code! We'll add this to our roadmap.

nicholas-masel · 2025-03-06T17:26:06Z

@thomas-neitmann @parmsam-pfizer We finally got around to this feature and it falls over with base and default package usage. More details here: #222 (comment)

parmsam-pfizer added the enhancement New feature or request label Mar 3, 2023

bms63 added this to logrx 0.3.0 Jun 12, 2023

bms63 added the release 0.3.0 label Jun 12, 2023

bms63 moved this to 📋 Backlog in logrx 0.3.0 Jun 12, 2023

parmsam-pfizer mentioned this issue Aug 28, 2023

Feature Request: Option to error if unapproved package is used #204

Open

kodesiba removed the release 0.3.0 label Jan 30, 2025

nicholas-masel mentioned this issue Feb 20, 2025

Ability to only log package use and unapproved packages, instead of functions #222

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Provide option to reject a script if using unapproved packages #168

Feature Request: Provide option to reject a script if using unapproved packages #168

parmsam-pfizer commented Mar 3, 2023

nicholas-masel commented Aug 29, 2023

parmsam-pfizer commented Aug 29, 2023

thomas-neitmann commented Sep 22, 2023

thomas-neitmann commented Sep 22, 2023

parmsam-pfizer commented Sep 22, 2023

nicholas-masel commented Mar 6, 2025 •

edited

Loading

Feature Request: Provide option to reject a script if using unapproved packages #168

Feature Request: Provide option to reject a script if using unapproved packages #168

Comments

parmsam-pfizer commented Mar 3, 2023

Feature Idea

Relevant Input

Relevant Output

Reproducible Example/Pseudo Code

nicholas-masel commented Aug 29, 2023

parmsam-pfizer commented Aug 29, 2023

thomas-neitmann commented Sep 22, 2023

thomas-neitmann commented Sep 22, 2023

parmsam-pfizer commented Sep 22, 2023

nicholas-masel commented Mar 6, 2025 • edited Loading

nicholas-masel commented Mar 6, 2025 •

edited

Loading