PROCFILE
procfile is a library for reading text data files (i.e /proc
files) in the fastest possible way.
#
How it worksThe library automatically adapts (through the iterations) its memory so that each file
is read with single read()
call.
Then the library splits the file into words, using the supplied separators. The library also supported quoted words (i.e. strings within of which the separators are ignored).
#
InitializationInitially the caller:
- calls
procfile_open()
to open the file and allocate the structures needed.
#
IterationsFor each iteration, the caller:
calls
procfile_readall()
to read updated contents. This call also rewinds (lseek()
to 0) before reading it.For every file, a BUFFER is used that is automatically adjusted to fit the entire file contents of the file. So the file is read with a single
read()
call (providing atomicity / consistency when the data are read from the kernel).Once the data are read, 2 arrays of pointers are updated:
a
words
array, pointing to each word in the data reada
lines
array, pointing to the first word for each lineThis is highly optimized. Both arrays are automatically adjusted to fit all contents and are updated in a single pass on the data.
The library provides a number of macros:
procfile_lines()
returns the # of lines readprocfile_linewords()
returns the # of words in the given lineprocfile_word()
returns a pointer the given word #procfile_line()
returns a pointer to the first word of the given line #procfile_lineword()
returns a pointer to the given word # of the given line #
#
CleanupWhen the caller exits:
- calls
procfile_free()
to close the file and free all memory used.
#
Performance- a raspberry Pi 1 (the oldest single core one) can process 5.000+
/proc
files per second. - a J1900 Celeron processor can process 23.000+
/proc
files per second per core.
To achieve this kind of performance, the library tries to work in batches so that the code and the data are inside the processor's caches.
This library is extensively used in Netdata and its plugins.