Add name #715

GuittenyMartin · 2025-09-22T07:35:52Z

add names in diarization and summary

Add a livekit agent to send metadata on s3
Add logic to map names to the right person
Add the formatting of the summary

Add a visible agent that joins the room and writes in a json all the active speaker informations.

Change the saving of the json metadata to the minio bucket_s3 to prepare the changes to map the name in the diarization

Add logic to map name to participant by aligning whisper output and metadata, calculating jaccard similarity for each speaker and participant Cleaning metadata extractor to pass ruff checks

get the name in the conversation and convert id to name in the metadata. Preference order: name attribute, metadata display_name, identity

Add metadata-agent container in compose to setup metadata agent dev Add run agents in makefile "make run" Add agent name in settings Setup livekit webhook in liverkit-server to start agent and recording simultaneously

Add the call of metadata agent when webhook from egress_started is received. Add dispatch id in recording model to be able to shutdown agent when recording end. Still need to make the transcription start after metadatas are uploaded on minio.

Change the formatting of the formatting to use the names instead of SPEAKER_00

Update start recording test because start_room_recording output changed.

Change FeatureFlag class to accept featureflag from posthog. add metadata_agent featuer flag to check if the feature flag is enabled for the creator of the recording. This feature flag will be used to allow some chosen users to test the named diarization feature.

Adding a condition on feature flag to get metadata in bucket Fixing the format_segments function to work without metadatas This is important because metadatas for names diarization will not be enabled for everyone at first

Fix the error created by trying to close agent when it is disabled. Make the agent hidden so users do not see it. It makes the diarization invisible for users.

Make the agent tolerant to no known owner In the tests we do not always have an owner so it is necessary to make the pipeline tolerant to that special case.

Add the possibility of getting egress manifest by sending the worker id in the notification. This will allow us to get the starting time of the recording to be able to align perfectly the metadatas and the transcription.

Copy the metadata agent from "Metadata agent #741" branch to have the VAD agent and the right output format for metadatas

Add pandas and numpy to use dataframe to calculate the speakers's names in diarization and improve performances.

Add livekit-pluging-silero for VAD agent in compose (it's not supposed to be started like this and will be changed)

format code with ruff to pass checks

Change the algorithm to map speakers to their name to use dataframe to improve performance.

sonarqubecloud · 2025-10-17T12:01:28Z

Quality Gate passed

Issues
7 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

Martin Guitteny added 3 commits September 19, 2025 17:32

🚧(agents) add metadata extractor agent

731bce9

Add a visible agent that joins the room and writes in a json all the active speaker informations.

🔨(agents) change json metadata dump to bucket_s3

0affc56

Change the saving of the json metadata to the minio bucket_s3 to prepare the changes to map the name in the diarization

🚧(summary) add logic to map name to participant + cleaning

214f41f

Add logic to map name to participant by aligning whisper output and metadata, calculating jaccard similarity for each speaker and participant Cleaning metadata extractor to pass ruff checks

GuittenyMartin linked an issue Sep 22, 2025 that may be closed by this pull request

Change in the transcription from SPEAKER to the names of the people in the video call. #486

Open

Martin Guitteny added 2 commits September 23, 2025 21:27

🚧(agents) convert id to name in metadatas

b4f6287

get the name in the conversation and convert id to name in the metadata. Preference order: name attribute, metadata display_name, identity

🔧(agents) add metadata agent in makefile compose and settings

9b78ab9

Add metadata-agent container in compose to setup metadata agent dev Add run agents in makefile "make run" Add agent name in settings Setup livekit webhook in liverkit-server to start agent and recording simultaneously

GuittenyMartin force-pushed the add-name branch 3 times, most recently from c83b1bd to d6cd005 Compare September 26, 2025 08:52

GuittenyMartin force-pushed the add-name branch from d6cd005 to e3267f9 Compare September 26, 2025 12:00

Martin Guitteny added 3 commits September 26, 2025 14:00

✨(summary) add the usage of metadata to use name in transcript

ab12080

Change the formatting of the formatting to use the names instead of SPEAKER_00

fixup! 🚧(agents) add metadata agent call on livekit webhook

cd4842e

✅(tests) update start recording test

34716b6

Update start recording test because start_room_recording output changed.

GuittenyMartin force-pushed the add-name branch from 255e9ca to 34716b6 Compare September 26, 2025 12:53

Martin Guitteny added 3 commits September 29, 2025 10:27

fixup! 🚧(agents) add metadata agent call on livekit webhook

4babe42

🐛(summary) fix transcription when metadata disabled

b4201e7

Adding a condition on feature flag to get metadata in bucket Fixing the format_segments function to work without metadatas This is important because metadatas for names diarization will not be enabled for everyone at first

GuittenyMartin force-pushed the add-name branch from a948779 to b4201e7 Compare October 3, 2025 16:15

fixup! 🚩(agents) add feature flag for metadata agent

64269c5

GuittenyMartin force-pushed the add-name branch from 31aef7a to 64269c5 Compare October 6, 2025 08:19

🐛(agents) hide agent and fix egress end when agent disabled

fd6f933

Fix the error created by trying to close agent when it is disabled. Make the agent hidden so users do not see it. It makes the diarization invisible for users.

GuittenyMartin force-pushed the add-name branch from 5547d2c to fd6f933 Compare October 6, 2025 09:09

🐛(agents) make agents tolerante to no owner

a02bea4

Make the agent tolerant to no known owner In the tests we do not always have an owner so it is necessary to make the pipeline tolerant to that special case.

GuittenyMartin force-pushed the add-name branch from 88f6c49 to a02bea4 Compare October 6, 2025 09:26

GuittenyMartin marked this pull request as ready for review October 6, 2025 09:34

Martin Guitteny added 4 commits October 10, 2025 11:26

🔨(summary) add egress manifest in summary

f7ba2cc

Add the possibility of getting egress manifest by sending the worker id in the notification. This will allow us to get the starting time of the recording to be able to align perfectly the metadatas and the transcription.

🎨(agents) change metadata agent

9e00ea6

Copy the metadata agent from "Metadata agent #741" branch to have the VAD agent and the right output format for metadatas

➕(summary) add numpy and pandas dependencies

561fe8d

Add pandas and numpy to use dataframe to calculate the speakers's names in diarization and improve performances.

➕(agents) add livekit-plugins in compose

82e7f2d

Add livekit-pluging-silero for VAD agent in compose (it's not supposed to be started like this and will be changed)

Martin Guitteny added 2 commits October 17, 2025 13:42

➕(agents) format code with ruff

67bc2ae

format code with ruff to pass checks

⚡️(summary) change name diarization to use dataframe

29ad484

Change the algorithm to map speakers to their name to use dataframe to improve performance.

GuittenyMartin force-pushed the add-name branch from d105d1c to 29ad484 Compare October 17, 2025 12:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add name #715

Add name #715

Uh oh!

GuittenyMartin commented Sep 22, 2025 •

edited

Loading

Uh oh!

sonarqubecloud bot commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add name #715

Are you sure you want to change the base?

Add name #715

Uh oh!

Conversation

GuittenyMartin commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

add names in diarization and summary

Uh oh!

sonarqubecloud bot commented Oct 17, 2025

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

GuittenyMartin commented Sep 22, 2025 •

edited

Loading