Disentangle Content-Encoding, Transfer-Encoding, character set conversion and chunked transfers. #14

mworrell · 2013-12-27T21:16:57Z

In Webmachine there is a bit confusion between Content-Encoding and Transfer-Encoding.

The Content-Encoding is applied by functions, though also on the chunks. This doesn't play well with gzip, as that should be applied to the whole entity and not to its chunks.

The idea behind Content-Encoding is that the server has multiple versions of an entity, and can select which version should be served. That means that the server has already a prepared compressed version of the data. The Range is also applied on this (compressed) version of the data.

The Transfer-Encoding is applied after the fetching the correct ranges and can consists of chunking or (further) compression. There can be multiple Transfer-Encodings, they are given in the header in the order that they were applied. For example, first chunking and then gzip will give:

Transfer-Encoding: chunked, gzip

This plays well with different content encodings.

In this way there will be a clear distinction: the controller provides the content and Webzmachine might add transfer encodings.

On a similar note, the character set is almost always UTF-8 and should be supplied by the content, i.e. not changed by Webzmachine.

I propose to add two new callbacks, remove one, and change another:

New:

content_encodings_provided/2
transfer_encodings_provided/2

Remove:

encodings_provided/2

Change return format of:

charsets_provided/2

All three will return a list of encodings/charsets, instead of tuples with the encoding and re-code functions.

The content_types_provided/2 function should then take the selected charset and encoding to select the correct content function (or the content function can handle that).

The selected encodings are available from:

webmachine_request:get_metadata('content-encoding', ReqData)
webmachine_request:get_metadata('chosen-charset', ReqData)
webmachine_request:get_metadata('content-type', ReqData)

The text was updated successfully, but these errors were encountered:

mworrell · 2013-12-27T21:17:28Z

/cc @mmzeeman @arjan @kaos

mmzeeman · 2013-12-28T22:53:31Z

Nice one :-) Some parts of webmachine desperately need a cleanup. This will make things more clear in this area.

Side note: There is a separate header for negotiating which transfer encoding to use. This is the TE header. Browsers generally don't send this header. This means that in the normal case identity and chunked are allowed, not gzip. Even if that is specified in the Accept-Encoding header. The Accept-Encoding header is restricted to specifying content encoding only. Proxies may add an TE header of course, but I don't think that is done in practice.

kaos · 2013-12-29T07:20:25Z

👍

…s for provided charsets and encodings. See zotonic/webzmachine#14

mworrell · 2014-01-15T10:15:33Z

The selected content-encoding/charset can be fetched with:

wrq:resp_content_encoding(ReqData).
wrq:resp_chosen_charset(ReqData).

There are some more new wrq functions:

resp_transfer_encoding/1, set_resp_transfer_encoding/2,
resp_content_encoding/1, set_resp_content_encoding/2,
resp_content_type/1, set_resp_content_type/2,
resp_chosen_charset/1, set_resp_chosen_charset/2,

See wrq.erl for more details.

Also added support for file:sendfile, and some new content return values, complete list is now:

iolist()
{device, IO}
{device, Length, IO}
{file, Filename}
{file Length, Filename}
{stream, StreamFun}
{stream, Size, StreamFun}
{writer, WriterFun}

mmzeeman · 2014-01-15T10:34:14Z

That looks nice. I will place the same controller workflow inside the new elli + machine solution.

…om Yaws. See also http://erlang.org/pipermail/erlang-bugs/2013-October/003818.html Issue #14

ghost assigned mworrell Dec 27, 2013

mworrell added a commit that referenced this issue Jan 15, 2014

Change return format for provided encodings and charsets. Issue #14

1d9adea

mworrell added a commit to zotonic/zotonic that referenced this issue Jan 15, 2014

core/modules: changes to controllers for new webzmachine return value…

984c7bd

…s for provided charsets and encodings. See zotonic/webzmachine#14

mworrell mentioned this issue Jan 15, 2014

Add support for sendfile #2

Closed

mworrell added a commit that referenced this issue Jan 16, 2014

Make sendfile behaviour switchable. Defaults to using the sendfile fr…

f31b966

…om Yaws. See also http://erlang.org/pipermail/erlang-bugs/2013-October/003818.html Issue #14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disentangle Content-Encoding, Transfer-Encoding, character set conversion and chunked transfers. #14

Disentangle Content-Encoding, Transfer-Encoding, character set conversion and chunked transfers. #14

mworrell commented Dec 27, 2013

mworrell commented Dec 27, 2013

mmzeeman commented Dec 28, 2013

kaos commented Dec 29, 2013

mworrell commented Jan 15, 2014

mmzeeman commented Jan 15, 2014

Disentangle Content-Encoding, Transfer-Encoding, character set conversion and chunked transfers. #14

Disentangle Content-Encoding, Transfer-Encoding, character set conversion and chunked transfers. #14

Comments

mworrell commented Dec 27, 2013

mworrell commented Dec 27, 2013

mmzeeman commented Dec 28, 2013

kaos commented Dec 29, 2013

mworrell commented Jan 15, 2014

mmzeeman commented Jan 15, 2014