Baler
Log Pattern Extraction
System Events can result in millions of log lines. The possible lines are unknown in advance, may arise only with rare significant events, and may change with new system software:
Typical log data tools rely on regular expression (or grep) matching, which requires advance knowledge of log lines in order to build the regular expression. Baler extracts patterns deterministically from log lines, with no advance information required. This results in a vastly smaller number of patterns to parse, and patterns can be compared across systems.
For more information
- Baler: Deterministic, lossless log message clustering tool N. Taerat, J. Brandt, A. Gentile, M. Wong, and C. Leangsuksun. In: Computer Science - Research and Development Volume 26, Numbers 3-4, 285-295, DOI: 10.1007/s00450-011-0155-3 Int’l. Supercomputing Conference (ISC). June 2011.
- New Systems, New Behaviors, New Patterns: Monitoring Insights from System Standup J. Brandt, A. Gentile, C. Martin, J. Repik, and N. Taerat Workshop on Monitoring and Analysis for High Performance Computing Systems Plus Applications (HPCMASPA) at IEEE Int’l. Conf. on Cluster Computing (CLUSTER) Sept 2015.