Skip to content

Question-answering system for blog post pages from the website: improvado.io

Notifications You must be signed in to change notification settings

CiriloRibeiro/QABlogPosts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

QABlogPosts

Question-answering system for blog post pages from the website: improvado.io

How does it work?

This is a Colab notebook that does web scraping on improvado.io/blog, taking most of the blog posts' titles and contents and dumping a JSON file.

After this, a pre-trained QA model called deepset/roberta-base-squad2 is used (https://huggingface.co/deepset/roberta-base-squad2), having as inputs the titles (treated as questions) and contents to generate answers.

As one can notice, only 21 questions were generated, to validate the output.

Some of the answers are pretty straightforward, while others aren't good at all, e.g.:

When asked: How to Build an Influencer Marketing Dashboard (+ 8 metrics to track)

Answer: Marketing teams can track all essential metrics and ROI of campaigns

When asked: The Top 7 Marketing Reporting Software Solutions for 2023

Answer: WhatagraphWhatagraph

Next improvements

The next step is to check key metrics such as Exact Match and F1-Score to validate the model. Moreover, another step is to use the web-scrapped data to train a new QA model.

About

Question-answering system for blog post pages from the website: improvado.io

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published