-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Review metadata for new Canons 6 corpora #20
Comments
It is so great to see all this new Coptic! I have looked only at shenoute.throne so far. For @somiyagawa and @amir-zeldes: I have two big questions and then need information for some metadata fields. Probably you will want to answer the questions first, since metadata depends on the answers:
Then once those questions are answered, I need info for each of the following metadata fields. Please add them to each file in GitDox. For most of the items, I do not know the information, so I cannot do it myself. An explanation of all the fields is on our wiki here (scroll down):
Many thanks! |
Again really looking forward to seeing this all published, and very grateful to @somiyagawa, Julien, and the other folks in Goettingen!!! |
The document splits are my fault - as you know, they need to be smaller than the entire contiguous section, since that will cause bad latency in ANNIS otherwise. I just split them into NBFB-sized chunks (that's always seemed to work, I think we did the same for Eagerness), and it happened to cut across R/V. So just to verify before I change them: would it be OK to do:
? @somiyagawa : this means I'll have to ask you to look at the translation boundaries for the first two documents again, sorry for the mix up! |
Thanks for looking at this Amir! There are a few things going on, so please don’t break the docs up quite yet.
As I mentioned upthread, we also need to break them based on contiguity in the modern repository. Someone needs to check and see where these mss live, document their call numbers, etc., and then break the documents based on how the mss are fragmented.
If this is all one continuous fragment sitting on a shelf in one place with one call #, then we could do what you propose. But first So or Julien need to look up this information. Then we can break the text into the proper document sizes and add the correct metadata.
Thanks! We are very close. Just a few last details.
…Sent from my iPad
On Jul 3, 2018, at 7:17 AM, Amir Zeldes ***@***.***> wrote:
The document splits are my fault - as you know, they need to be smaller than the entire contiguous section, since that will cause bad latency in ANNIS otherwise. I just split them into NBFB-sized chunks (that's always seemed to work, I think we did the same for Eagerness), and it happened to cut across R/V.
So just to verify before I change them: would it be OK to do:
1-10
11-20
21-28
47
?
@somiyagawa : this means I'll have to ask you to look at the translation boundaries for the first two documents again, sorry for the mix up!
—
You are receiving this because you were assigned.
Reply to this email directly, view it on GitHub, or mute the thread.
|
OK, sure, just let me know what splits you want and I can implement those, with the caveat that I may need to makes some added splits due to processing concerns (but I'll never need to merge two things that don't belong together). |
Ok. We need to hear from So.
Sent from my iPad.
|
Hey folks, I'm separating out the OT into a separate thread. #21 |
I have looked at shenoute.obliged. The same metadata and document division questions from shenoute.throne apply to shenoute.obliged. |
Also if there are any manuscript parallels in shenoute.obliged, those need to be accounted for in the document structure. |
Need urns and metadata review in general for new corpora:
sahidic.ot (CoptOT base text version)see Coptic OT for fall 2018 publication #21Note that CoptOT translation has been switched to Brenton.
The text was updated successfully, but these errors were encountered: