If you find this useful, please consider supporting my work with a donation.
A collection of tools for processing CSV files using streams. This package provides functions to count data rows and split CSV data into smaller chunks while preserving headers.
npm install @humanwhocodes/csv-toolsThis package exports two main functions for working with CSV data via ReadableStream objects:
Counts rows in a CSV file with configurable options.
import { countRows } from "@humanwhocodes/csv-tools";
// From a file in Node.js
import { createReadStream } from "node:fs";
import { ReadableStream } from "node:stream/web";
const fileStream = createReadStream("data.csv");
const webStream = ReadableStream.from(fileStream);
// Count only data rows (exclude header)
const dataRowCount = await countRows(webStream);
console.log(`Found ${dataRowCount} data rows`);
// Count all rows including header
const fileStream2 = createReadStream("data.csv");
const webStream2 = ReadableStream.from(fileStream2);
const totalRowCount = await countRows(webStream2, { countHeaderRow: true });
console.log(`Found ${totalRowCount} total rows`);Parameters:
stream(ReadableStream<Uint8Array>) - A readable stream containing CSV dataoptions(Object, optional) - Configuration optionscountHeaderRow(boolean, default:false) - Whether to count the header rowcountEmptyRows(boolean, default:false) - Whether to count empty rows
Returns: Promise<number> - The count of rows in the CSV file
An async generator function that yields strings of mini CSV files. Each chunk contains the header row followed by up to chunkSize data rows.
import { chunk } from "@humanwhocodes/csv-tools";
// From a file in Node.js
import { createReadStream } from "node:fs";
import { ReadableStream } from "node:stream/web";
const fileStream = createReadStream("data.csv");
const webStream = ReadableStream.from(fileStream);
// Process CSV in chunks of 50 rows
for await (const csvChunk of chunk(webStream, { chunkSize: 50 })) {
// Each csvChunk is a string with header + up to 50 data rows
console.log("Processing chunk:");
console.log(csvChunk);
// Process the chunk...
}Parameters:
stream(ReadableStream<Uint8Array>) - A readable stream containing CSV dataoptions(Object) - Configuration optionschunkSize(number, optional) - Number of data rows per chunk. Default: 100includeEmptyRows(boolean, optional) - Whether to include empty rows. Default: false
Returns: AsyncGenerator<string> - An async generator yielding CSV chunks as strings
import { countRows, chunk } from "@humanwhocodes/csv-tools";
// Fetch CSV from URL
const response = await fetch("https://example.com/data.csv");
const reader = response.body.getReader();
const stream = new ReadableStream({
start(controller) {
return pump();
function pump() {
return reader.read().then(({ done, value }) => {
if (done) {
controller.close();
return;
}
controller.enqueue(value);
return pump();
});
}
},
});
// Count rows
const rowCount = await countRows(stream);
console.log(`Total rows: ${rowCount}`);
// Or process in chunks
const response2 = await fetch("https://example.com/data.csv");
const reader2 = response2.body.getReader();
const stream2 = new ReadableStream({
start(controller) {
return pump();
function pump() {
return reader2.read().then(({ done, value }) => {
if (done) {
controller.close();
return;
}
controller.enqueue(value);
return pump();
});
}
},
});
for await (const csvChunk of chunk(stream2, { chunkSize: 100 })) {
// Process each chunk
await processData(csvChunk);
}- Stream-based processing - Memory efficient handling of large CSV files
- Preserves headers - Each chunk includes the CSV header row
- Handles edge cases - Properly skips empty lines and ignores trailing newlines
- TypeScript support - Full type definitions included
- Cross-platform - Works in Node.js, Deno, and browsers
Copyright 2025 Nicholas C. Zakas
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.