Add support for other text formats in the file content filter #470

dpomykala · 2025-11-10T20:23:03Z

Change Summary

The filecontent filter now uses a specialized extractor function if registered for the given format (currently PDF and DOCX). For all other formats the extract_txt function is used as a fallback.

This allows to use the filecontent filter for all plain-text files, regardless of the file extension.

Related issue number

Resolves #464.

Checklist

Tests for the changes exist and pass on CI
Documentation reflects the changes where applicable
Change is documented in CHANGELOG.md (if applicable)
My PR is ready to review

Use a specialized extractor function if registered for the given format (currently PDF and DOCX). For all other formats use the `extract_txt` function as a fallback. Resolves tfeldmann#464.

Support other file formats in the filecontent filter

113bc8b

Use a specialized extractor function if registered for the given format (currently PDF and DOCX). For all other formats use the `extract_txt` function as a fallback. Resolves tfeldmann#464.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add support for other text formats in the file content filter #470

Add support for other text formats in the file content filter #470

Uh oh!

dpomykala commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Add support for other text formats in the file content filter #470

Are you sure you want to change the base?

Add support for other text formats in the file content filter #470

Uh oh!

Conversation

dpomykala commented Nov 10, 2025

Change Summary

Related issue number

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant