-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathNEWS
223 lines (145 loc) · 6.08 KB
/
NEWS
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
* Noteworthy changes in release ?.? (?-?-?) []
** Bug Fixes
Datamash now passes the -z/--zero-terminated flag to the sort(1) child
process when used with "--sort --zero-terminated". Additionally,
if the system's sort(1) does not support -z, datamash reports the error
and exits. Previously it would omit the "-z" when running sort(1),
resulting in incorrect results.
Documentation fixes and spelling corrections.
Incorrect format in a decorate(1) error breaking compilation on some
systems.
* Noteworthy changes in release 1.7 (2020-04-23) [testing]
** New Features
decorate(1): new program - sorts input in non-standard ordering, e.g.
IPv4, IPv6, roman numerals.
New operations: sha224/sha384.
New operations: geomean (Geometric mean) and harmmean (Harmonic mean).
* Noteworthy changes in release 1.6 (2020-02-24) [stable]
** Bug Fixes
The 'gutnum' operation (introduced in vresion 1.5) now correctly
prints detected numbers without truncating them.
* Noteworthy changes in release 1.5 (2019-09-17) [stable]
** New Features
Datamash now accepts backslash-escaped characters in field names.
This allows working with named fields containing dash/mins,colons,commas
or field names starting with digits (Note the interplay between
backslash and shell quoting). The following are equivalent,
and sum an input field named 'FOO-BAR':
datamash -H sum FOO\\-BAR < input.txt
datamash -H sum 'FOO\-BAR' < input.txt
datamash -H sum "FOO\\-BAR" < input.txt
New operations: dirname, basename
These behave just like dirname(1) and basename(1):
$ echo /home/foo/bar.txt | datamash dirname 1 basename 1
/home/foo bar.txt
New operations: extname, barename
'extname' extract the extension of the file name.
'barename' (not to be confused with 'basename') extract the basename
without the extension.
Example:
$ echo /home/foo/bar.tar.gz | datamash barename 1 extame 1
bar tar.gz
New operation: getnum
This operation extracts a number from a string.
'getnum' accepts an optional single letter option:
getnum:n - natural numbers (positive integers, including zero)
getnum:i - integers
getnum:d - decimal point numbers
getnum:p - positive decimal point numbers (this is the default)
getnum:h - hex numbers
getnum:o - octal numbers
Examples:
$ echo foo-42.0-bar | datamash getnum 1
42.0
$ echo foo-42.0-bar | datamash getnum:n 1
42
$ echo foo-42.0-bar | datamash getnum:i 1
-42
$ echo foo-42.0-bar | datamash getnum:d 1
-42.0
New operation: cut
Similar to cut(1), it copies the input field to the output as-is.
The advantage over cut(1) is that combined with datamash's other features,
input fields can be specified by name instead of column number, and
output fields can be re-ordered and duplicated.
Example:
$ printf "a b c\n1 X 6\n" | datamash -W -H cut c,a,c
cut(c) cut(a) cut(c)
6 1 6
** Bug fixes
Datamash now correctly calculates mode/antimode for negative values.
In version 1.4 and earlier, the following produced incorrect results:
$ echo -1 | datamash-1.4 mode 1
1.844674407371e+19
* Noteworthy changes in release 1.4 (2018-12-22) [stable]
** New Features
New option: -C/--skip-comments to skip comment lines (lines starting
with '#' or ';' and optional whitespace).
* Noteworthy changes in release 1.3 (2018-03-16) [stable]
** New Features
New option: --format=FMT sets printf style floating-point format.
Example:
$ echo '50.5' | datamash --format "%07.3f" sum 1
050.500
$ echo '50.5' | datamash --format "%07.3e" sum 1
5.050e+01
New option: -R/--round=N rounds numeric values to N decimal places.
New option: --output-delimiter=X overrides -t/-W.
New operation: trimmean (trimmed mean value).
To calculate 20% trimmed mean:
$ printf "%s\n" 13 3 7 33 3 9 | datamash trimmean:0.2 1
8
** Bug fixes
Datamash now builds correctly with external OpenSSL libraries
(./configure --with-openssl=yes). The 'configure' script now reports
whether internal or external libraries are used:
$ ./configure [OPTIONS]
[...]
Configuration summary for datamash
md5/sha*: internal (gnulib)
OR
md5/sha*: external (-lcrypto)
* Noteworthy changes in release 1.2 (2017-08-22) [stable]
** New Features
New operations:
perc (percentile),
range (max-min of values in group/column)
Improved 'check' operation:
Expected number of lines/fields can be specified as parameter.
** Improvements
Improved bash-completion script installation path (see README for details).
* Noteworthy changes in release 1.1.1 (2017-01-19) [stable]
** Bug fixes
'check' command correctly counts a trailing delimiter at end of lines.
'transpose' command correctly handles missing fields on the last line.
* Noteworthy changes in release 1.1.0 (2016-01-16) [stable]
** New Features
Bumped version to 1.1.0 to better comply to semver.
New operations:
crosstab (cross-tabulation / pivot-tables),
check (verify tabular structure),
bin (bin numeric values)
strbin (bin strings values)
pearson correlation,
covariance,
rounding functions: round,floor,ceil,trunc,frac
** Improvements
Speed, Portability, Tests, Coverage improvements.
* Noteworthy changes in release 1.0.7 (2015-06-29) [stable]
** New Features
New operations: md5, sha1/256/512, base64, rmdup.
New option --narm to ignore NaN/NA values.
New feature: ability to specify field by names instead of numbers
(require using --header-in or -H).
New translations added.
** Improvements
Speed, Portability, Coverage improvements.
* Noteworthy changes in release 1.0.6 (2014-07-29) [stable]
** New Features
New operations: transpose, reverse.
** Improvements
Tests: improve portability, add I/O error tests, add few edge-case tests.
Build: improve man-page generation, cross-compiling, auxiliary build scripts.
Documentation: expand and fix man-page (and shorten --help screen).
* Noteworthy changes in release 1.0.5 (2014-07-15) [stable]
First release as GNU Datamash.