datamash

Calculates basic statistical measures on textual input files.

Made by Assaf Gordon

  • data-analysis

  • Statistical Analyzer

  • text-analysis

What is datamash?

GNU datamash is a command-line program that performs simple calculations such as counting, summing, finding minimum and maximum values, calculating means and standard deviations, and coalescing strings on input files. With a rich set of statistical functions, datamash enables quick assessment of information within textual input files. As an example, a user can calculate basic statistics including mean, first quartile, median, third quartile, interquartile range, sample standard deviation, and the p-value of the Jarque-Bera test for normal distribution, all through a single, concise command

Highlights

  • Performs a wide range of statistical calculations on input data, including count, sum, min, max, mean, standard deviation, and more
  • Supports coalescing of string data, enabling analysis of textual input files
  • Provides a compact, efficient command-line interface for quick data analysis tasks

Platforms

  • Windows
  • Linux
  • Mac
  • BSD

Languages

  • English

Features

    • Command line interface