Plain-text CSV for longevity

Plain-text CSV for longevity

BusDK treats business and accounting data as durable records that must remain accessible decades into the future. The core design constraint is that the canonical dataset should stay readable with common, general-purpose tooling and should not depend on proprietary application storage or vendor-controlled file formats.

The preferred default representation is UTF-8 CSV paired with explicit schemas. CSV is plain text, ubiquitous across operating systems and languages, and straightforward to inspect, diff, and transform; it fits a longevity-oriented approach where a repository should remain intelligible even if BusDK itself is not available.

CSV is a delivery convention rather than the goal. BusDK should be able to adopt other storage or serialization approaches over time as long as the system preserves long-term accessibility — in the sense of open documentation, broad tool support, and predictable export back to simple, tabular text formats — consistent with National Archives guidance on selecting sustainable formats for electronic records.