MINT - Implement GPU accelerated nvJPEG encoder #20

jestrada-atlassian · 2025-03-16T22:34:38Z

Adding a GPU accelerated jpeg encoder. This will allow us to accelerate parts of our transcoder thumbnail job that we previously have been unable to run on GPU.

https://www.loom.com/share/9ee806965ce244189b3de057b82dcae7

w3sip

At least one critical problem that I see (memory leak), but looks good otherwise.

w3sip · 2025-03-17T14:39:39Z

libavcodec/nvjpegenc.c

+    }
+
+    // Handle YUV input formats and populate nv_image data
+    if (sw_format == AV_PIX_FMT_YUV420P || sw_format == AV_PIX_FMT_YUVJ420P) {


Why do we want three different conditions / implementations here? Seems the same.

Its a leftover from trying to get NV12 to work. Unfortunately couldn't get it to work. Will consolidate.

w3sip · 2025-03-17T14:45:31Z

libavcodec/nvjpegenc.c

+    }
+
+    // Retrieve the bitstream again to populate output buffer
+    CHECK_NVJPEG(nvjpegEncodeRetrieveBitstream(


This would leak out_buf

w3sip · 2025-03-17T14:46:07Z

libavcodec/nvjpegenc.c

+    NvjpegContext* ctx = avctx->priv_data;
+
+    if (ctx->encoder_state) {
+        CHECK_NVJPEG(nvjpegEncoderStateDestroy(ctx->encoder_state));


Should we continue with cleanup even if one of these fails?

Should be safe for now. We'll emit a log if this does happen but once we get to this point we'd have a valid output so I don't think we should throw an error even if this fails. We should still try to destroy the rest of the states if possible anyway.

But CHECK_NVJPEG would return on error, right?

You're right. Removed it from all these and just manually emitting a log

w3sip · 2025-03-17T14:48:52Z

libavcodec/nvjpegenc.c

+
+    ctx->width = avctx->width;
+    ctx->height = avctx->height;
+    if ((ret = nvjpeg_init(avctx)) < 0)


Personal nit: I usually try to avoid assignment and check of a condition in the same line. Depends on the codebase convention, though.
Another nit, I often prefer to have curly braces even on single-line scope. Again, depends on codebase convention.

I'm roughly following the ffmpeg code convention found here. This implementation doesn't follow it perfectly but I'll clean it up before submitting the patch to them,

https://ffmpeg.org/developer.html#Coding-Rules-1

jpujol · 2025-03-17T18:03:03Z

@jestrada-atlassian : do we expect a lot of gain here? even the software encoder for jpeg is extremely fast
EDITED: just saw your video

jpujol · 2025-03-17T18:06:52Z

is this something we can submit upstream? It would be nice to have it reviewed by the nvidia guys

jestrada-atlassian · 2025-03-17T18:28:33Z

is this something we can submit upstream? It would be nice to have it reviewed by the nvidia guys

I have this on my TODO list alongside getting pad_npp reviewed. Shooting for sometime mid quarter.

w3sip · 2025-03-22T00:53:46Z

libavcodec/nvjpegenc.c

+    }
+
+    // Retrieve the bitstream again to populate output buffer
+    ret = CHECK_NVJPEG(nvjpegEncodeRetrieveBitstream(


Remove CHECK_NVJPEG (still would return without freeing the buffer)

w3sip · 2025-03-22T00:55:40Z

libavcodec/nvjpegenc.c

+        nv_image.channel[2] = frame->data[2];
+        nv_image.pitch[2] = frame->linesize[2];
+    } else if (input_format != NVJPEG_INPUT_TYPE_RGB) {
+        // Handle BGR/RGB input formats


I may be misunderstanding, bit either the comment is wrong, or != is? Could be totally misreading this, though.

Simplified this. We're already checking the software format earlier so it doesn't make sense to check it here again

Could simplify it further with
int planes = (input_format == NVJPEG_INPUT_TYPE_YUV) ? 3 : NVJPEG_MAX_COMPONENT;
and then

for (i = 0; i < planes; i++) { nv_image.channel[i] = frame->data[i]; nv_image.pitch[i] = frame->linesize[i]; }

, up to you.

w3sip

Looks good. Couple of things to look at, but neither is a must.

w3sip · 2025-03-22T14:41:11Z

libavcodec/nvjpegenc.c

+        nv_image.channel[2] = frame->data[2];
+        nv_image.pitch[2] = frame->linesize[2];
+    } else if (input_format != NVJPEG_INPUT_TYPE_RGB) {
+        // Handle BGR/RGB input formats


Could simplify it further with
int planes = (input_format == NVJPEG_INPUT_TYPE_YUV) ? 3 : NVJPEG_MAX_COMPONENT;
and then

for (i = 0; i < planes; i++) { nv_image.channel[i] = frame->data[i]; nv_image.pitch[i] = frame->linesize[i]; }

, up to you.

w3sip · 2025-03-22T14:42:20Z

libavcodec/nvjpegenc.c

+        ctx->nvjpeg_handle, ctx->encoder_state, out_buf, &out_buf_size, NULL);
+
+    if (ret != NVJPEG_STATUS_SUCCESS) {
+    	av_free(out_buf);


Perhaps a log statement here?

jestrada-atlassian added 2 commits March 16, 2025 15:27

Add GPU acelerated nvJPEG encoder

3cbab59

Add GPU acelerated nvJPEG encoder

3cc17f7

jestrada-atlassian requested review from a team, sjhsieh, jpujol, w3sip, ayushgr, yiming-loom, ncooleyy and mchin3loom and removed request for a team March 16, 2025 22:34

w3sip requested changes Mar 17, 2025

View reviewed changes

Address feedback

8159cff

jestrada-atlassian requested a review from w3sip March 22, 2025 00:45

w3sip reviewed Mar 22, 2025

View reviewed changes

Address feedback and simplify

bd0c464

w3sip approved these changes Mar 22, 2025

View reviewed changes

Address feedback

ed7f31a

jestrada-atlassian merged commit d4fc3bd into n7.1.loom-patch3 Mar 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MINT - Implement GPU accelerated nvJPEG encoder #20

MINT - Implement GPU accelerated nvJPEG encoder #20

jestrada-atlassian commented Mar 16, 2025

w3sip left a comment

w3sip Mar 17, 2025

jestrada-atlassian Mar 22, 2025

w3sip Mar 17, 2025

w3sip Mar 17, 2025

jestrada-atlassian Mar 22, 2025

w3sip Mar 22, 2025

jestrada-atlassian Mar 22, 2025

w3sip Mar 17, 2025

jestrada-atlassian Mar 22, 2025

jpujol commented Mar 17, 2025 •

edited

Loading

jpujol commented Mar 17, 2025

jestrada-atlassian commented Mar 17, 2025

w3sip Mar 22, 2025

jestrada-atlassian Mar 22, 2025

w3sip Mar 22, 2025

jestrada-atlassian Mar 22, 2025

w3sip Mar 22, 2025

w3sip left a comment

w3sip Mar 22, 2025

w3sip Mar 22, 2025

MINT - Implement GPU accelerated nvJPEG encoder #20

MINT - Implement GPU accelerated nvJPEG encoder #20

Conversation

jestrada-atlassian commented Mar 16, 2025

w3sip left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jpujol commented Mar 17, 2025 • edited Loading

jpujol commented Mar 17, 2025

jestrada-atlassian commented Mar 17, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

w3sip left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jpujol commented Mar 17, 2025 •

edited

Loading