Skip to content

Commit 9c8547e

Browse files
authored
MINOR: Make crate READMEs consistent (#2437)
1 parent 6b4bbd0 commit 9c8547e

File tree

25 files changed

+131
-33
lines changed

25 files changed

+131
-33
lines changed

CONTRIBUTING.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -253,5 +253,5 @@ $ prettier --version
253253
After you've confirmed your prettier version, you can format all the `.md` files:
254254

255255
```bash
256-
prettier -w {ballista,datafusion,datafusion-examples,dev,docs,python}/**/*.md
256+
prettier -w {ballista,datafusion,data-access,datafusion-cli,datafusion-examples,dev,docs}/**/*.md
257257
```

ballista/rust/client/Cargo.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,7 @@ license = "Apache-2.0"
2222
version = "0.6.0"
2323
homepage = "https://github.com/apache/arrow-datafusion"
2424
repository = "https://github.com/apache/arrow-datafusion"
25+
readme = "README.md"
2526
authors = ["Apache Arrow <[email protected]>"]
2627
edition = "2021"
2728
rust-version = "1.59"

ballista/rust/core/Cargo.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,7 @@ license = "Apache-2.0"
2222
version = "0.6.0"
2323
homepage = "https://github.com/apache/arrow-datafusion"
2424
repository = "https://github.com/apache/arrow-datafusion"
25+
readme = "README.md"
2526
authors = ["Apache Arrow <[email protected]>"]
2627
edition = "2018"
2728
build = "build.rs"

ballista/rust/core/README.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,5 +20,4 @@
2020
# Ballista Core Library
2121

2222
This crate contains the Ballista core library which is used as a dependency by the `ballista-client`,
23-
`ballista-scheduler`, and `ballista-executor` crates. Refer to <https://crates.io/crates/ballista> for
24-
general Ballista documentation.
23+
`ballista-scheduler`, and `ballista-executor` crates.

ballista/rust/executor/Cargo.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,7 @@ license = "Apache-2.0"
2222
version = "0.6.0"
2323
homepage = "https://github.com/apache/arrow-datafusion"
2424
repository = "https://github.com/apache/arrow-datafusion"
25+
readme = "README.md"
2526
authors = ["Apache Arrow <[email protected]>"]
2627
edition = "2018"
2728

ballista/rust/executor/README.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -19,5 +19,4 @@
1919

2020
# Ballista Executor Process
2121

22-
This crate contains the Ballista executor process. Refer to <https://crates.io/crates/ballista> for
23-
documentation.
22+
This crate contains the Ballista executor process.

ballista/rust/scheduler/Cargo.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,7 @@ license = "Apache-2.0"
2222
version = "0.6.0"
2323
homepage = "https://github.com/apache/arrow-datafusion"
2424
repository = "https://github.com/apache/arrow-datafusion"
25+
readme = "README.md"
2526
authors = ["Apache Arrow <[email protected]>"]
2627
edition = "2018"
2728

ballista/rust/scheduler/README.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -19,5 +19,4 @@
1919

2020
# Ballista Scheduler Process
2121

22-
This crate contains the Ballista scheduler process. Refer to <https://crates.io/crates/ballista> for
23-
documentation.
22+
This crate contains the Ballista scheduler process.

data-access/README.md

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,12 @@
1717
under the License.
1818
-->
1919

20-
# Data Access Layer
20+
# DataFusion Data Access Layer
2121

22-
This module contains an `async` API for accessing data, either remotely or locally. Currently, it's based on the object store interfaces. In the future, this module may include interfaces for accessing databases, or streaming data.
22+
[DataFusion](df) is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.
23+
24+
This crate is a submodule of DataFusion that provides an `async` API for accessing data, either remotely or locally.
25+
Currently, it is based on the object store interfaces. In the future, this module may include interfaces for accessing
26+
databases, or streaming data.
27+
28+
[df]: https://crates.io/crates/datafusion

datafusion-cli/Cargo.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -26,6 +26,7 @@ license = "Apache-2.0"
2626
homepage = "https://github.com/apache/arrow-datafusion"
2727
repository = "https://github.com/apache/arrow-datafusion"
2828
rust-version = "1.59"
29+
readme = "README.md"
2930

3031
[dependencies]
3132
arrow = { version = "12" }

datafusion-cli/README.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,8 @@
1919

2020
# DataFusion Command-line Interface
2121

22+
[DataFusion](df) is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.
23+
2224
The DataFusion CLI allows SQL queries to be executed by an in-process DataFusion context, or by a distributed
2325
Ballista context.
2426

@@ -75,6 +77,7 @@ cargo build
7577
```
7678

7779
## Ballista
80+
7881
If you want to execute the SQL in ballista by `datafusion-cli`, you must build/compile the `datafusion-cli` with features of "ballista" first.
7982

8083
```bash
@@ -86,4 +89,6 @@ The DataFusion CLI can connect to a Ballista scheduler for query execution.
8689

8790
```bash
8891
datafusion-cli --host localhost --port 50050
89-
```
92+
```
93+
94+
[df]: https://crates.io/crates/datafusion

datafusion/common/README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,8 @@
1919

2020
# DataFusion Common
2121

22-
This is an internal module for the most fundamental types of [DataFusion][df].
22+
[DataFusion](df) is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.
23+
24+
This crate is a submodule of DataFusion that provides common data types and utilities.
2325

2426
[df]: https://crates.io/crates/datafusion

datafusion/expr/Cargo.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ description = "Logical plan and expression representation for DataFusion query e
2121
version = "7.0.0"
2222
homepage = "https://github.com/apache/arrow-datafusion"
2323
repository = "https://github.com/apache/arrow-datafusion"
24-
readme = "../README.md"
24+
readme = "README.md"
2525
authors = ["Apache Arrow <[email protected]>"]
2626
license = "Apache-2.0"
2727
keywords = [ "datafusion", "logical", "plan", "expressions" ]

datafusion/expr/README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,8 @@
1919

2020
# DataFusion Logical Plan and Expressions
2121

22-
This is an internal module for fundamental expression types of [DataFusion][df].
22+
[DataFusion](df) is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.
23+
24+
This crate is a submodule of DataFusion that provides data types and utilities for logical plans and expressions.
2325

2426
[df]: https://crates.io/crates/datafusion

datafusion/expr/src/aggregate_function.rs

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -682,7 +682,7 @@ pub fn is_correlation_support_arg_type(arg_type: &DataType) -> bool {
682682
}
683683

684684
/// Return `true` if `arg_type` is of a [`DataType`] that the
685-
/// [`ApproxPercentileCont`] aggregation can operate on.
685+
/// [`AggregateFunction::ApproxPercentileCont`] aggregation can operate on.
686686
pub fn is_approx_percentile_cont_supported_arg_type(arg_type: &DataType) -> bool {
687687
matches!(
688688
arg_type,

datafusion/expr/src/lib.rs

Lines changed: 11 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,16 @@
1515
// specific language governing permissions and limitations
1616
// under the License.
1717

18+
//! [DataFusion](https://github.com/apache/arrow-datafusion)
19+
//! is an extensible query execution framework that uses
20+
//! [Apache Arrow](https://arrow.apache.org) as its in-memory format.
21+
//!
22+
//! This crate is a submodule of DataFusion that provides types representing
23+
//! logical query plans ([LogicalPlan]) and logical expressions ([Expr]) as well as utilities for
24+
//! working with these types.
25+
//!
26+
//! The [expr_fn] module contains functions for creating expressions.
27+
1828
mod accumulator;
1929
pub mod aggregate_function;
2030
pub mod array_expressions;
@@ -44,18 +54,7 @@ pub use aggregate_function::AggregateFunction;
4454
pub use built_in_function::BuiltinScalarFunction;
4555
pub use columnar_value::{ColumnarValue, NullColumnarValue};
4656
pub use expr::Expr;
47-
pub use expr_fn::{
48-
abs, acos, and, approx_distinct, approx_percentile_cont, array, ascii, asin, atan,
49-
avg, bit_length, btrim, case, ceil, character_length, chr, coalesce, col, concat,
50-
concat_expr, concat_ws, concat_ws_expr, cos, count, count_distinct, date_part,
51-
date_trunc, digest, exists, exp, floor, in_list, in_subquery, initcap, left, length,
52-
ln, log10, log2, lower, lpad, ltrim, max, md5, min, not_exists, not_in_subquery, now,
53-
now_expr, nullif, octet_length, or, random, regexp_match, regexp_replace, repeat,
54-
replace, reverse, right, round, rpad, rtrim, scalar_subquery, sha224, sha256, sha384,
55-
sha512, signum, sin, split_part, sqrt, starts_with, strpos, substr, sum, tan, to_hex,
56-
to_timestamp_micros, to_timestamp_millis, to_timestamp_seconds, translate, trim,
57-
trunc, upper, when,
58-
};
57+
pub use expr_fn::*;
5958
pub use expr_schema::ExprSchemable;
6059
pub use function::{
6160
AccumulatorFunctionImplementation, ReturnTypeFunction, ScalarFunctionImplementation,

datafusion/jit/Cargo.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ description = "Just In Time (JIT) compilation support for DataFusion query engin
2121
version = "7.0.0"
2222
homepage = "https://github.com/apache/arrow-datafusion"
2323
repository = "https://github.com/apache/arrow-datafusion"
24-
readme = "../README.md"
24+
readme = "README.md"
2525
authors = ["Apache Arrow <[email protected]>"]
2626
license = "Apache-2.0"
2727
keywords = [ "arrow", "query", "sql" ]

datafusion/jit/README.md

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
<!---
2+
Licensed to the Apache Software Foundation (ASF) under one
3+
or more contributor license agreements. See the NOTICE file
4+
distributed with this work for additional information
5+
regarding copyright ownership. The ASF licenses this file
6+
to you under the Apache License, Version 2.0 (the
7+
"License"); you may not use this file except in compliance
8+
with the License. You may obtain a copy of the License at
9+
10+
http://www.apache.org/licenses/LICENSE-2.0
11+
12+
Unless required by applicable law or agreed to in writing,
13+
software distributed under the License is distributed on an
14+
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
15+
KIND, either express or implied. See the License for the
16+
specific language governing permissions and limitations
17+
under the License.
18+
-->
19+
20+
# DataFusion JIT
21+
22+
[DataFusion](df) is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.
23+
24+
This crate is a submodule of DataFusion that provides JIT code generation.
25+
26+
[df]: https://crates.io/crates/datafusion

datafusion/physical-expr/Cargo.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ description = "Physical expression implementation for DataFusion query engine"
2121
version = "7.0.0"
2222
homepage = "https://github.com/apache/arrow-datafusion"
2323
repository = "https://github.com/apache/arrow-datafusion"
24-
readme = "../README.md"
24+
readme = "README.md"
2525
authors = ["Apache Arrow <[email protected]>"]
2626
license = "Apache-2.0"
2727
keywords = [ "arrow", "query", "sql" ]

datafusion/physical-expr/README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,10 @@
1717
under the License.
1818
-->
1919

20-
# DataFusion Physical Expr
20+
# DataFusion Physical Expressions
2121

22-
This is an internal module for fundamental physical expression types of [DataFusion][df].
22+
[DataFusion](df) is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.
23+
24+
This crate is a submodule of DataFusion that provides data types and utilities for physical expressions.
2325

2426
[df]: https://crates.io/crates/datafusion

datafusion/proto/Cargo.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ description = "Protobuf serialization of DataFusion logical plan expressions"
2121
version = "7.0.0"
2222
homepage = "https://github.com/apache/arrow-datafusion"
2323
repository = "https://github.com/apache/arrow-datafusion"
24-
readme = "../README.md"
24+
readme = "README.md"
2525
authors = ["Apache Arrow <[email protected]>"]
2626
license = "Apache-2.0"
2727
keywords = [ "arrow", "query", "sql" ]

datafusion/proto/README.md

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
<!---
2+
Licensed to the Apache Software Foundation (ASF) under one
3+
or more contributor license agreements. See the NOTICE file
4+
distributed with this work for additional information
5+
regarding copyright ownership. The ASF licenses this file
6+
to you under the Apache License, Version 2.0 (the
7+
"License"); you may not use this file except in compliance
8+
with the License. You may obtain a copy of the License at
9+
10+
http://www.apache.org/licenses/LICENSE-2.0
11+
12+
Unless required by applicable law or agreed to in writing,
13+
software distributed under the License is distributed on an
14+
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
15+
KIND, either express or implied. See the License for the
16+
specific language governing permissions and limitations
17+
under the License.
18+
-->
19+
20+
# DataFusion Proto
21+
22+
[DataFusion](df) is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.
23+
24+
This crate is a submodule of DataFusion that provides a protocol buffer format for representing query plans and expressions.
25+
26+
[df]: https://crates.io/crates/datafusion

datafusion/row/Cargo.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ description = "Row backed by raw bytes for DataFusion query engine"
2121
version = "7.0.0"
2222
homepage = "https://github.com/apache/arrow-datafusion"
2323
repository = "https://github.com/apache/arrow-datafusion"
24-
readme = "../README.md"
24+
readme = "README.md"
2525
authors = ["Apache Arrow <[email protected]>"]
2626
license = "Apache-2.0"
2727
keywords = [ "arrow", "query", "sql" ]

datafusion/row/README.md

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
<!---
2+
Licensed to the Apache Software Foundation (ASF) under one
3+
or more contributor license agreements. See the NOTICE file
4+
distributed with this work for additional information
5+
regarding copyright ownership. The ASF licenses this file
6+
to you under the Apache License, Version 2.0 (the
7+
"License"); you may not use this file except in compliance
8+
with the License. You may obtain a copy of the License at
9+
10+
http://www.apache.org/licenses/LICENSE-2.0
11+
12+
Unless required by applicable law or agreed to in writing,
13+
software distributed under the License is distributed on an
14+
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
15+
KIND, either express or implied. See the License for the
16+
specific language governing permissions and limitations
17+
under the License.
18+
-->
19+
20+
# DataFusion Row
21+
22+
[DataFusion](df) is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.
23+
24+
This crate is a submodule of DataFusion that provides a format for row-based data.
25+
26+
[df]: https://crates.io/crates/datafusion

dev/release/README.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,14 +34,16 @@ Python binding or Ballista always requires a new DataFusion version release.
3434

3535
### Major Release
3636

37-
DataFusion typically has major releases from the `master` branch every 3 months, including breaking API changes.
37+
DataFusion typically has major releases from the `master` branch every 3 months, including breaking API changes.
3838

3939
### Minor Release
4040

4141
Starting v7.0.0, we are experimenting with maintaining an active stable release branch (e.g. `maint-7.x`). Every month, we will review the `maint-*` branch and prepare a minor release (e.g. v7.1.0) when necessary. A patch release (v7.0.1) can be requested on demand if it is urgent bug/security fix.
4242

4343
#### How to add changes to `maint-*` branch?
44+
4445
If you would like to propose your change for inclusion in the maintenance branch
46+
4547
1. follow normal workflow to create PR to `master` branch and wait for its approval and merges.
4648
2. after PR is squash merged to `master`, branch from most recent maintenance branch (e.g. `maint-7-x`), cherry-pick the commit and create a PR to maintenance branch (e.g. `maint-7-x`).
4749

0 commit comments

Comments
 (0)