Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

284 MBox Refresher #295

Open
wants to merge 86 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 83 commits
Commits
Show all changes
86 commits
Select commit Hold shift + click to select a range
8c1021d
Created parse_mbox_latest_date and refresh_mbox functions and updated…
ian-lastname Apr 19, 2024
72238a7
Edited download_pipermail to save pipermail files as mbox files, crea…
ian-lastname Apr 24, 2024
99fb7e3
Changed function name from refresh_mbox to refresh_mod_mbox for consi…
ian-lastname Apr 25, 2024
618f2d0
Added checks in refresh functions and in download_mod_mbox_per_month …
ian-lastname Apr 25, 2024
0751218
fix github checks
carlosparadis Apr 28, 2024
be4ff32
Re-added error message in refresh_pipermail when an http error is enc…
ian-lastname Apr 29, 2024
b5be04e
Added comments to download_pipermail
ian-lastname Apr 30, 2024
d2ce222
Minor documentation update for setup verification.
daomcgill Sep 10, 2024
7c585ae
i #284 Refactor download_pipermail function
daomcgill Sep 15, 2024
69ca163
i #284 Updated documentation and modified function for download_piper…
daomcgill Sep 17, 2024
b9a886b
i #284 Edited download_pipermail() and Added refresh_pipermail() and …
daomcgill Sep 17, 2024
3c88140
i #284 Added more descriptive comments. Made minor changes to piperma…
daomcgill Sep 19, 2024
5de3aa2
i #284 Added more descriptive comments. Made minor changes to piperma…
daomcgill Sep 19, 2024
8a373d6
Merge branch '284-mbox-download-refresher' of https://github.com/sail…
daomcgill Sep 19, 2024
b91389b
i #284 Added download_mod_mbox function and edited notebook
daomcgill Sep 21, 2024
0cc4123
i #284 Added refresh_mod_mbox function for refreshing Mod Mbox archives
daomcgill Sep 22, 2024
0dc6001
i #284 Updated Notebook
daomcgill Oct 1, 2024
f0027dc
i #284 Testing Github Actions
daomcgill Oct 2, 2024
9b9c896
i #284 Renamed save_folder_mail parameter to mbox
daomcgill Oct 2, 2024
7249c9b
i #284 Updated Notebook download_mail.Rmd
daomcgill Oct 3, 2024
2a1ba98
Revert "i #284 Testing Github Actions"
daomcgill Oct 3, 2024
7bf8ba6
i #284 Refactored parse_mbox_latest_date and Fixed Roxygen Errors
daomcgill Oct 3, 2024
aa60648
i #284 Update NEWS.md
daomcgill Oct 3, 2024
64e0646
i #284 Updated Notebook, exec/mailinglist.R and R/mail.R
daomcgill Oct 6, 2024
2b6a963
i #284 Changed Notebook to Use Project Working Directory
daomcgill Oct 6, 2024
dc40dba
i #284 Minor Fix: Folder Paths in helix.yml
daomcgill Oct 6, 2024
d6f3b41
i #284 fixes incorrect call
carlosparadis Oct 9, 2024
f02ecb1
i #284 attempt fix on Actions
carlosparadis Oct 9, 2024
7f38d1c
i #284 incomplete storytelling review
carlosparadis Oct 9, 2024
309fa34
i #284 downgrade version of R for XML
carlosparadis Oct 9, 2024
e04bd31
i #284 gcc not found on Actions
carlosparadis Oct 9, 2024
dbd7092
i #284 Refactored download_mail.Rmd
beydlern Oct 10, 2024
ea109bd
Merge branch 'master' into 284-mbox-download-refresher
daomcgill Oct 10, 2024
c4b9d16
i #284 Testing GitHub Actions after Merge
daomcgill Oct 10, 2024
90b05ed
i #284 GH Actions (changed perceval path)
daomcgill Oct 10, 2024
3e5f8f7
i #284 Change Roxygen version
daomcgill Oct 10, 2024
4af2c21
i #284 Update Notebook and config file
daomcgill Oct 11, 2024
8094402
i #284 Final Updates for Mail Notebook
daomcgill Oct 15, 2024
5fb3af7
i #284 Fixed Relative Paths in a Notebook
beydlern Oct 18, 2024
e56848a
i #230 create config file interface
anthonyjlau Nov 12, 2024
b462ddb
Merge branch 'master' into 284-mbox-download-refresher
carlosparadis Nov 12, 2024
def1660
i #284 minor fixes and XML dependency
carlosparadis Nov 12, 2024
bfc75cb
revert utags
carlosparadis Nov 12, 2024
c1830f6
i #284 More narrative and config fixes
carlosparadis Nov 12, 2024
4842100
i #284 Remove description tags
carlosparadis Nov 12, 2024
0f9769e
i #284 more minor doc formatting fixes
carlosparadis Nov 12, 2024
6f6a59b
i #284 Updates to exec/mailinglist.R and Minor Fixes for Mail Configu…
daomcgill Nov 12, 2024
775b5a6
Merge branch 'master' of https://github.com/sailuh/kaiaulu
daomcgill Nov 12, 2024
93f214e
Merge branch 'master' into 284-mbox-download-refresher
daomcgill Nov 12, 2024
e27a604
i #295 Change argument for exec from 'tabulate' to 'parse'
daomcgill Nov 12, 2024
6a5fed6
i #284 Testing Fix for Actions
daomcgill Nov 13, 2024
ffb5c9c
i #284 Try Adding Debugging
daomcgill Nov 13, 2024
e55b6e2
Revert "i #284 Try Adding Debugging"
daomcgill Nov 13, 2024
c797219
i #284 Revert ctags version
daomcgill Nov 13, 2024
092e2ab
Update commit_message_id_coverage.Rd
daomcgill Nov 13, 2024
56dff9c
i #284 Please work
daomcgill Nov 13, 2024
fd97af0
i #295 Last try
daomcgill Nov 13, 2024
8709b95
Revert "i #295 Last try"
daomcgill Nov 13, 2024
71054f9
Revert "i #284 Please work"
daomcgill Nov 13, 2024
382383d
Revert "Update commit_message_id_coverage.Rd"
daomcgill Nov 13, 2024
f11e452
Revert "i #284 Revert ctags version"
daomcgill Nov 13, 2024
09d00c3
Reapply "i #284 Try Adding Debugging"
daomcgill Nov 13, 2024
216fe07
i #284 R version
daomcgill Nov 13, 2024
99823d7
i #284 another R version change attempt
daomcgill Nov 13, 2024
6cd5e11
i #284 Version that was passing check
daomcgill Nov 13, 2024
132355d
i #295 Small changes from updated config
daomcgill Nov 30, 2024
dceded0
i #284 Updates to Mail Notebook
daomcgill Dec 1, 2024
5515d7c
i #284 Update Mailing List Exec to use "refresh"
daomcgill Dec 6, 2024
a89b983
Reverse github actions to match master
carlosparadis Dec 8, 2024
4aa2af2
Remove git.R print statements
carlosparadis Dec 8, 2024
a366573
Remove prints from mail tests
carlosparadis Dec 8, 2024
2887232
Remove prefix underline
carlosparadis Dec 8, 2024
800fccc
Remove additional git prints
carlosparadis Dec 8, 2024
67de9f8
Internal api functions should not be display
carlosparadis Dec 8, 2024
ae1ba66
Unit tests now pass locally
carlosparadis Dec 8, 2024
3697fe3
Remove more prints..
carlosparadis Dec 8, 2024
7fc9e41
Remove strange mbox file path
carlosparadis Dec 8, 2024
207d0c4
Fix parse_mbox removing stderr = TRUE
carlosparadis Dec 8, 2024
d3dd232
Add loop to parse_mbox on notebook
carlosparadis Dec 8, 2024
f74aff3
Documentation pass
carlosparadis Dec 8, 2024
557ad10
i #284 Update refresh functions
daomcgill Dec 8, 2024
1cf86e5
i #284 Missing file in previous commit
daomcgill Dec 8, 2024
f3048a9
i #284 Edit exec/mailinglist.R parse to take file as arg
daomcgill Dec 9, 2024
5ce5830
i #284 Use pipermail path for parsing pipermail folder
daomcgill Dec 9, 2024
41f0850
Merge branch 'master' into 284-mbox-download-refresher
daomcgill Dec 9, 2024
4df52d9
i #284 Minor fixes
daomcgill Dec 9, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion DESCRIPTION
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,8 @@ Authors@R: c(
person('Nicole', 'Hoess', role = c('ctb')),
person('Anthony', 'Lau', role = c('ctb')),
person('Sean', 'Sunoo', role = c('ctb')),
person('Ian Jaymes', 'Iwata', role= c('ctb')),
person('Ian Jaymes', 'Iwata', role = c('ctb')),
person('Dao', 'McGill', role = c('ctb')),
person('Nicholas', 'Beydler', role = c('ctb')),
person('Mark', 'Burgess', role = c('ctb'))
)
Expand Down
6 changes: 4 additions & 2 deletions NAMESPACE
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,6 @@ export(assign_exact_identity)
export(bipartite_graph_projection)
export(commit_message_id_coverage)
export(community_oslom)
export(convert_pipermail_to_mbox)
export(dependencies_to_sdsmj)
export(download_bugzilla_perceval_rest_issue_comments)
export(download_bugzilla_perceval_traditional_issue_comments)
Expand All @@ -16,7 +15,6 @@ export(download_jira_issues)
export(download_jira_issues_by_date)
export(download_jira_issues_by_issue_key)
export(download_mod_mbox)
export(download_mod_mbox_per_month)
export(download_pipermail)
export(dv8_clsxb_to_clsxj)
export(dv8_clsxj_to_clsxb)
Expand Down Expand Up @@ -184,17 +182,21 @@ export(parse_jira_rss_xml)
export(parse_line_metrics)
export(parse_line_type_file)
export(parse_mbox)
export(parse_mbox_latest_date)
export(parse_nvdfeed)
export(parse_r_dependencies)
export(parse_r_function_definition)
export(parse_r_function_dependencies)
export(parse_rfile_ast)
export(process_gz_to_mbox_in_folder)
export(query_src_text)
export(query_src_text_class_names)
export(query_src_text_namespace)
export(read_temporary_file)
export(recolor_network_by_community)
export(refresh_jira_issues)
export(refresh_mod_mbox)
export(refresh_pipermail)
export(smell_missing_links)
export(smell_organizational_silo)
export(smell_radio_silence)
Expand Down
9 changes: 8 additions & 1 deletion NEWS.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,11 @@ __kaiaulu 0.0.0.9700 (in development)__

### NEW FEATURES

* Refactor of all R/mail.R mailing list functions for downloading and refreshing both pipermail and mod mbox archives. [#284](https://github.com/sailuh/kaiaulu/issues/284)
* `refresh_mod_mbox()` and `refresh_pipermail()` has been added. They are both functions that downloads mbox issues that are not already downloaded up until the current year and month. [#284](https://github.com/sailuh/kaiaulu/issues/284)
* `parse_mbox_latest_date()` has been added. This function returns the file name of the downloaded mbox file containing the latest date for use by `download_mbox_per_month()` and `download_pipermail` to implement a refresh capability. [#284](https://github.com/sailuh/kaiaulu/issues/284)
* `refresh_jira_issues()` had been added. It is a wrapper function for the previous downloader and downloads only issues greater than the greatest key already downloaded.
* `download_jira_issues()`, `download_jira_issues_by_issue_key()`, and `download_jira_issues_by_date()` has been added. This allows for downloading of Jira issues without the use of JirAgileR [#275](https://github.com/sailuh/kaiaulu/issues/275) and specification of issue Id and created ranges. It also interacts with `parse_jira_latest_date` to implement a refresh capability.
* The GitHUB API has been expanded to use refresh, along with other functions. `github_api_project_issue_search` has been added that makes the search/issues endpoint API calls. `github_api_project_issue_or_pr_comments_by_date` and `github_api_project_issue_by_date` have been added to download issue data and comments by date ranges. `github_parse_search_issues_refresh` has been added that parses the issue data downloaded from the search endpoint in the refresh_issues folder. `github_api_project_issue_refresh` and `github_api_project_issue_or_pr_comment_refresh` were added to download issue data or comments respectively that have not already been downloaded. `format_created_at_from_file` was added to retrieve the greatest date from a JSON file. See the Reference Docs on GitHub section for more details. [#282](https://github.com/sailuh/kaiaulu/issues/282)
* `config.R` now contains a set of getter functions used to centralize the gathering of configuration data and these getter functions are used to refactor configuration file information gathering. For example, loading configuration file information with variable assignment is as follows `git_repo_path <- config_file[["version_control"]][["log"]]` but refactoring with a config.R getter function becomes `git_repo_path <- get_git_repo_path(config_file)`. [#230](https://github.com/sailuh/kaiaulu/issues/230)
* `refresh_jira_issues()` had been added. It is a wrapper function for the previous downloader and downloads only issues greater than the greatest key already downloaded. [#275](https://github.com/sailuh/kaiaulu/issues/275)
Expand Down Expand Up @@ -30,7 +35,9 @@ __kaiaulu 0.0.0.9700 (in development)__

### MINOR IMPROVEMENTS

* Issue #275, when introducing the concept of refresh on JIRA, affected some notebooks that still relied on data in that format. This issue change either notebook or config file to conform to the new JIRA downloader [#312](https://github.com/sailuh/kaiaulu/issues/312)
* `parse_mbox_latest_date()` now uses the new mbox naming convention for the latest date. [#284](https://github.com/sailuh/kaiaulu/issues/284)
* All mailing list documentation can now be found in `download_mail.Rmd`. [#284](https://github.com/sailuh/kaiaulu/issues/284)
* `download_pipermail()` now downloads all the txt and txt.gz files in the accessed pipermail archive as mbox files. [#284](https://github.com/sailuh/kaiaulu/issues/284)
* The line metrics notebook now provides further guidance on adjusting the snapshot and filtering.
* The R File and R Function parser can now properly parse R folders which contain folders within (not following R package structure). Both `.r` and `.R` files are also now captured (previously only one of the two were specified, but R accepts both). [#235](https://github.com/sailuh/kaiaulu/issues/235)
* Refactor GoF Notebook in Graph GoF and Text GoF Notebooks [#224](https://github.com/sailuh/kaiaulu/issues/224)
Expand Down
2 changes: 1 addition & 1 deletion R/example.R
Original file line number Diff line number Diff line change
Expand Up @@ -550,7 +550,7 @@ example_jira_issue_comments <- function(folder_path = "/tmp", folder_name) {
example_mailing_list_two_threads <- function(folder_path = "/tmp", folder_name, file_name) {

# Create folder & repo
folder_path <- io_make_folder(folder_path=folder_path, folder_name = folder_name)
folder_path <- io_make_folder(folder_path = folder_path, folder_name = folder_name)

# Step 1: Create fake mbox replies and assign them to variables for easy editing
thread_1_reply_1 <- make_mbox_reply(mailing_list="test-list",
Expand Down
Loading