Skip to content

Commit

Permalink
Merge branch 'main' into fall2024
Browse files Browse the repository at this point in the history
  • Loading branch information
DaltonAlves committed Feb 11, 2025
2 parents e22e9c9 + 61bcd35 commit 1a8499a
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 4 deletions.
10 changes: 7 additions & 3 deletions webarchives/notes.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ title: "Web Archives - Notes"
permalink: /webarchives/notes/
parent: GW Web Archives
---
# Helpful Resources
## Helpful Resources
- [Known Platform Issues ](https://support.archive-it.org/hc/en-us/articles/9897233696148-Social-media-and-other-platforms-status)
- A selection of platforms that the Archive-It team monitors for changes in capture and replay.
- [Scoping Recommendations for Specific Sites](https://support.archive-it.org/hc/en-us/sections/201841373)
Expand All @@ -17,14 +17,18 @@ parent: GW Web Archives
- Archiving Tableau
- [Archive-It Blog Post on use of Youtube-Dl in Archive-it Stack](https://archive-it.org/post/the-stack-youtube-dl-guide/)

# General Notes:
## Helpful Resources for Web Admins/Website Owners
- [Creating Preservable Websites(Library of Congress)](https://www.loc.gov/programs/web-archiving/for-site-owners/creating-preservable-websites/)
- [Web Archivability (Nicholas Taylor)](https://nullhandle.org/web-archivability/index.html)

## General Notes:
- Brozzler crawls can not be scheduled. This is problematic as more and more of our regularly crawled sites require Brozzler.
- Expanding Crawl to Accept Vimeo Videos
- Add the following seed scope rules:
- Ignore Robots.txt
- Expand Scope to include URL if it matches the SURT: http://(com,vimeocdn

# Seed/Crawl Notes
## Seed/Crawl Notes
- [President Granberg Inauguration](https://inauguration.gwu.edu/) (no longer active)
- Embedded media not compliant with youtube-dl
- GW Law Course Catalog
Expand Down
2 changes: 1 addition & 1 deletion webarchives/webmetadata.md
Original file line number Diff line number Diff line change
Expand Up @@ -56,7 +56,7 @@ Metadata should be applied to seeds in Archive-It when they are created. It is *
| **Required** | Explanation |
| ---------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Definition | Organization responsible for collecting the archived content. |
| Example Usage for SCRC | - Special Collections Research Center, The George Washington University<br> - George Washington University Libraries |
| Example Usage for SCRC | - Special Collections Research Center. The George Washington University<br> - George Washington University Libraries |
| Standards | DACS 2.2 |
| Guidance | Identify the institution responsible for selecting websites for archiving, crawling the websites, and creating and maintaining the metadata that describes the content |
| Note | Use SCRC when content falls under SCRC collecting scope. Use GW Libraries when content falls outside of SCRC collecting scope.
Expand Down

0 comments on commit 1a8499a

Please sign in to comment.