Skimlit model improvement #287

Rahul1758 · 2021-12-14T12:23:06Z

Rahul1758
Dec 14, 2021

`Hey Daniel @mrdbourke,

I came across another research paper (https://arxiv.org/pdf/1909.04054.pdf) which achieves better results on PubMed using BERT. I read through the paper and want to implement the BERT+Transformer+CRF approach which gives an f1 score of 92.1%. I have a few doubts in Section 4 (Training and Implementation):

The way they have prepared the dataset is by joining sentences of abstract separated by ['SEP'] tag and then passing it to BERT. So I suppose each sample in dataset will be a string consisting of all sentences of an abstract joined together as mentioned above. Is my assumption correct?
I know how to use ['CLS'] output embedding of BERT for sentence classification. But how do I add a Transformed layer and CRF on this ['CLS'] token as per the paper?

Can you help me with this?
`

mrdbourke · 2021-12-16T08:31:00Z

mrdbourke
Dec 16, 2021
Maintainer

Hey @Rahul1758,

It looks like they use a [SEP] token in between each sentence in an abstract rather than each actual abstract.

Then they use BERT to represent all the sentences in a document as one long sequence of words.

As for the model architecture, they share their code and data on GitHub: https://github.com/allenai/sequential_sentence_classification

I'd check that out for seeing how they added a transformer layer on top of the BERT representations.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Skimlit model improvement #287

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Skimlit model improvement #287

Uh oh!

Rahul1758 Dec 14, 2021

Replies: 1 comment

Uh oh!

mrdbourke Dec 16, 2021 Maintainer

Rahul1758
Dec 14, 2021

mrdbourke
Dec 16, 2021
Maintainer