translated original implementation notes letter from Russian to English

kolomenkin · kolomenkin · commit d085b5289c78 · 2022-05-09T14:27:17.000+04:00
diff --git a/README.md b/README.md
@@ -1,4 +1,4 @@
-# Asynchronous Log File Reader [![Code license](https://img.shields.io/github/license/work-examples/async-log-reader)](LICENSE)
+﻿# Asynchronous Log File Reader [![Code license](https://img.shields.io/github/license/work-examples/async-log-reader)](LICENSE)
 
 **Example project:** Log file reader and filter (`grep` analog).
 Application reads specified file and writes matchig lines to `stdout`.
@@ -16,16 +16,17 @@ C++ exceptions are not used (forbidden).
 | **master**  | [![CI status](https://github.com/work-examples/async-log-reader/actions/workflows/build.yml/badge.svg?branch=master)](https://github.com/work-examples/async-log-reader/actions/workflows/build.yml?query=branch%3Amaster)   | [![CodeQL Code Analysis Status](https://github.com/work-examples/async-log-reader/actions/workflows/codeql-analysis.yml/badge.svg?branch=master)](https://github.com/work-examples/async-log-reader/actions/workflows/codeql-analysis.yml?query=branch%3Amaster) |
 | **develop** | [![CI status](https://github.com/work-examples/async-log-reader/actions/workflows/build.yml/badge.svg?branch=develop)](https://github.com/work-examples/async-log-reader/actions/workflows/build.yml?query=branch%3Adevelop) | \[not applicable\]                                                                                                                                                                                                                                               |
 
-## Test Task Description
+## C++ Programmer's Test Task Description
 
 Detailed task description is provided in a separate document:
 
-- [task-description.md](docs/task-description.md)
+- [C++ Programmer's Test Task Description](docs/task-description.md) (`docs/task-description.md`)
 
-## Task Implementation Remarks
+## Test Task Implementation Notes
 
-Original implementation remarks letter in Russian is provided here:
+Original implementation notes letter is provided here in two languages:
 
-- [solution-notes-letter.ru.md](docs/solution-notes-letter.ru.md)
+- [EN: Test Task Implementation Notes](docs/implementation-notes-letter.md) (in English, `docs/implementation-notes-letter.md`)
+- [RU: Заметки по решению](docs/implementation-notes-letter.ru.md) (in Russian, `docs/implementation-notes-letter.ru.md`)
 
 ---
diff --git a/docs/implementation-notes-letter.md b/docs/implementation-notes-letter.md
@@ -0,0 +1,102 @@
+﻿# Test Task Implementation Notes
+
+## Decision progress
+
+The task execution took much longer than I originally estimated.
+
+It was quite humiliating when the first version was 5-6 times slower than `grep`
+at a very high level of speed optimization with all sorts of statically allocated arrays without memory re-allocations.
+And I expected that the result should already be better than analogues. Moreover, I had a specialized algorithm,
+while `grep` has a different syntax and the universal algorithm inside. I supposed `grep` to be slower by definition.
+
+Originally I had the line match algorithm with dynamic programming and `O(P*N)` memory.
+Then I replaced it with another one, faster and with constant memory.
+And... it became only 2 times slower than `grep`. It was almost a success `:)`
+At the same time, according to the profiler, 80+% of time was spent exactly in the line match algorithm.
+
+I had no doubt that a more efficient algorithm must exist for this task.
+The only way to defeat `grep` was to add a few optimizations that simply scrolled faster
+algorithm in the most popular scenarios. This change accelerated the matching algorithm by about 6 times,
+and the total running time of the program accelerated by 3 times.
+The improved algorithm gave a victory over grep by only 25-35%.
+
+After that, according to the profiler, almost half of the time was spent
+in a line match algorithm, and near to 40% of the time was spent in a synchronous `ReadFile()` call.
+
+I decided that this is the finest hour of asynchronous file reading!
+Thus, the operating system will read the next data block of the file during parsing and processing the previous one.
+I implemented and ... nothing. The total running time has not changed.
+But the profiler showed a redistribution of time towards the line match algorithm.
+It was very strange that the line matching algorithm slowed down. And I still don't understand why it slowed down.
+I am convinced that this 40% spent by `ReadFile()` could be compressed to a maximum of 5%
+by parallelizing data proofreading and data processing.
+Perhaps this is somehow related to the fact that the data is in the system file cache,
+and it is not really read from the disk (shorter IRP path).
+Perhaps this banal copying of memory in kernel mode is poorly parallelized.
+Maybe it was worth rewriting so that reading from disk in a dedicated thread was performed ...
+
+In the next iteration I tried to map the file to memory.
+This solution does not meet requirements because it may throw SEH exceptions in case of disk read errors.
+And I had doubts about the effectiveness of the speed of loading new pages in this solution.
+Result was slightly slower, the total time of the program has increased by 20% percent.
+Although it is also strange when the disk cache is warmed up.
+Theoretically, if the data is in the disk cache, then it would be possible to map it
+to process readonly virtual memory in `O(1)`,
+and then save time on transitions to kernel mode during memory scan + save time on copying memory.
+
+## Testing and Notes
+
+I tested using web server log, 2 GB, 5.5 million lines,
+average line length was 380 bytes, all lines are no longer than 1024 bytes.
+1600 lines out of 5.5M matched the pattern. I chose the pattern `*string*` as the most popular in everyday life.
+
+The SSD drive was used, but I warmed it up so that all the data got to the system file cache.
+
+CPU: `Intel Core i5 8th Gen`, laptop edition.
+
+Application built under `x64` architecture worked faster than under `x86`.
+
+The `FILE_FLAG_SEQUENTIAL_SCAN` flag did not give a performance boost on a warmed cache.
+Without warming the cache, it must be measured separately.
+
+Sometimes the application execution time is kept at +25% for a long time.
+Most likely this is due to the fact that I have a laptop and CPU cores have economical modes.
+
+The latest application version takes 1.6 seconds to process test data while `grep` takes 2.5 seconds.
+
+**ADDED:**  
+I also implemented reading the file in a separate thread. The file operation is synchronous,
+synchronization between threads is done with the lock free loop (spinlock).
+It gave a total gain of 25% over the synchronous and asynchronous API solutions (total work time is 1.2 seconds).
+This is 2 times faster than `grep`.
+
+## Implementation features
+
+I kept all four implementations of reading files. You can switch them in code:
+
+```cpp
+#if 0
+#if 0
+    CSyncLineReader _lineReader;
+#else
+    CMappingLineReader _lineReader;
+#endif
+#else
+#if 0
+    CAsyncLineReader _lineReader;
+#else
+    CLockFreeLineReader _lineReader;
+#endif
+#endif
+```
+
+The solution contains unit tests in a separate project based on `gtest` framework.
+
+As required by the challenge, the main console application is built with C++ exceptions disabled and no RTTI.
+
+I have used some parts of the STL at my own risk.
+These parts do not use exceptions and work without unnecessary overhead.
+I see no reason not to use cheap abstractions that allow you to write cleaner and more error-free code.
+I mean all kinds of `std::unique_ptr`, `std::string_view`, `std::optional` and etc
+
+---
diff --git a/docs/implementation-notes-letter.ru.md b/docs/implementation-notes-letter.ru.md
diff --git a/docs/task-description.md b/docs/task-description.md
@@ -1,4 +1,4 @@
-# C++ Programmer's Test Task
+# C++ Programmer's Test Task Description
 
 It is necessary to write a class in pure C++ that can read huge text log files
 (hundreds of megabytes, tens of gigabytes) as quickly as possible and produce lines

Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,4 @@`
`1`		`-# C++ Programmer's Test Task`
	`1`	`+# C++ Programmer's Test Task Description`
`2`	`2`
`3`	`3`	`It is necessary to write a class in pure C++ that can read huge text log files`
`4`	`4`	`(hundreds of megabytes, tens of gigabytes) as quickly as possible and produce lines`