+ For each high-level building block that decomposes into well-known
+ lower-level operations, the APIs will informatively define a generic
+ emulation path to allow for future extensibility.
+
It may also work on a higher-level API to load a custom pre-trained Machine Learning model for inference in the browser.
Motivation and Background
+
+ Computer Vision enables computers to gain understanding from images
+ or videos, Natural Language Processing enables interaction between
+ computers and human languages, and Speech Recognition enables
+ computers to recognize and translate spoken language into text.
+ Bringing these experiences to the web in a privacy-preserving manner
+ requires efficient machine learning inference capabilities built into
+ the browser.
+
Enabling Machine Learning inference in
the browser (as opposed e.g. to in the cloud) enhances privacy, since input
data such as locally sourced images or video streams stay within the
@@ -213,6 +222,11 @@
and retrieve the output when hardware execution completes.
+
+ This Working Group puts priority on building blocks required by
+ well-known model architectures in the fields of Computer Vision,
+ Natural Language Processing and Speech Recognition.
+
The APIs in scope of this group will not be tied to any particular
platform and will be implementable on top of existing major platform
From c06abcac911a105bd1eb7c0ec7eb3ef96fdd3607 Mon Sep 17 00:00:00 2001
From: Anssi Kostiainen
Date: Thu, 8 Dec 2022 11:57:54 +0200
Subject: [PATCH 08/11] Add OpenXLA Project to Coordination
Fix #28
---
charter.html | 12 ++++++++++++
1 file changed, 12 insertions(+)
diff --git a/charter.html b/charter.html
index 93ce058..ea1000f 100644
--- a/charter.html
+++ b/charter.html
@@ -498,6 +498,18 @@
ECMA TC39
TC39 defines the JavaScript language whose primitives are key in how WebNN access data (e.g. ArrayBuffer
). Possible work on operator overloading would also impact possible evolutions of the WebNN API.
+
+ - OpenXLA Project
+ -
+ OpenXLA Project develops StableHLO, a portable ML compute
+ operation set that makes frameworks easier to deploy across
+ different hardware. WebNN API targets diverse hardware platforms
+ and defines an operation set whose high-level operations can be
+ decomposed to low level primitives that can map to StableHLO
+ operations. Coordination and alignment between these two
+ operation sets is beneficial to the open ML ecosystem.
+
+
Date: Mon, 9 Jan 2023 11:45:01 +0200
Subject: [PATCH 11/11] Clarify Scope
Move examples of well-known model architectures from the bullet
list into the text section that talks about priority use cases.
Add transformers as another example.
Grammar fix: s/Allow/Allows
Fix #23
---
charter.html | 19 ++++++++++---------
1 file changed, 10 insertions(+), 9 deletions(-)
diff --git a/charter.html b/charter.html
index 11c0958..51de72c 100644
--- a/charter.html
+++ b/charter.html
@@ -208,24 +208,25 @@
This Web API for neural network inference hardware acceleration:
- - Allow to construct a neural network computational graph by common
- building blocks, including constant values and base operations such
- as convolution, pooling, softmax, normalization, fully connected,
- activation, recurrent neural network (RNN) and long short-term memory
- (LSTM);
+
- Allows to construct a neural network computational graph by common
+ building blocks required by well-known model architectures: constant
+ values and base operations such as convolution, pooling, softmax,
+ normalization, fully connected, and activation;
- - Allow to compile the neural network to native optimized format
+
- Allows to compile the neural network to native optimized format
for hardware execution;
- - Allow to setup input from various sources on the Web, e.g. array
+
- Allows to setup input from various sources on the Web, e.g. array
buffers, media streams, schedule the asynchronous hardware execution,
and retrieve the output when hardware execution completes.
This Working Group puts priority on building blocks required by
- well-known model architectures in the fields of Computer Vision,
- Natural Language Processing and Speech Recognition.
+ well-known model architectures such as recurrent neural network
+ (RNN), long short-term memory (LSTM) and transformers in the fields
+ of Computer Vision, Natural Language Processing and Speech
+ Recognition.
The APIs in scope of this group will not be tied to any particular