Skip to content

Analyze and score conversation transcripts with AWS integrations. Transforms raw chat data into insightful metrics, offering a comprehensive view of conversation quality. Features scoring algorithms, data processing pipelines, and AWS Glue ETL processes.

Notifications You must be signed in to change notification settings

traksb/ConvoQualityEvaluator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

Repository files navigation

```markdown

Conversation Quality Evaluation with AWS and GPT Models

This repository provides a solution to analyze conversation quality using a combination of AWS S3 storage, PySpark data processing, and OpenAI's GPT models. The conversations are fetched from AWS S3, processed, and then evaluated using a pre-defined scoring mechanism with the aid of a GPT model.

Features

  • Session-based Message Extraction: Filter and organize messages by session IDs.
  • Conversation Fetching from S3: Extract and preprocess conversation data directly from S3 URIs.
  • Transcript Generation: Convert conversations into a readable transcript format.
  • GPT-based Scoring: Use a GPT model to calculate the score for a conversation's answer based on a system message.
  • AWS Glue ETL Integration: Integrate with AWS Glue for ETL processes.

Getting Started

Prerequisites

  • Python 3.x
  • AWS Account with access to S3
  • boto3
  • PySpark
  • OpenAI's GPT model access

Installation

  1. Clone the repo: ```bash git clone [your-repo-url] ```
  2. Navigate to the directory: ```bash cd [repo-name] ```
  3. Install the required packages: ```bash pip install -r requirements.txt ```

Usage

  1. Configure AWS credentials and specify the relevant S3 URI paths within the code.
  2. Adjust system_message, client information, and model name as necessary.
  3. Run the script. It will process the conversation data, score it, and start an AWS Glue ETL process (you'll need to complete the function for this).

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

This project is licensed under the MIT License.

Acknowledgments

  • OpenAI for GPT models.
  • AWS for cloud and data services.

About

Analyze and score conversation transcripts with AWS integrations. Transforms raw chat data into insightful metrics, offering a comprehensive view of conversation quality. Features scoring algorithms, data processing pipelines, and AWS Glue ETL processes.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages