Mixed data types
Numbers, text, blanks, and symbols mixed in one column can confuse summaries and downstream tools.
A useful CSV needs columns that are understandable, consistently formatted, and clear enough for the next person or tool to interpret.
MLdeck Data Quality reviews schema signals in the browser so confusing columns can be inspected before analysis or AutoML.
Schema clarity is about whether each column has a clear role and format. A column might look numeric but contain text labels. Dates may use several formats. IDs can look like ordinary categories. Text fields may contain values that should have been split into separate columns.
Those issues do not always make a file unusable. They mean the file deserves review before decisions depend on it.
Numbers, text, blanks, and symbols mixed in one column can confuse summaries and downstream tools.
Different date formats can make ordering, filtering, and time-based interpretation unreliable.
IDs and keys may be useful for joining tables, but they often need review before analysis or modeling.
Column names like value, status, type, or flag may need context before they are safe to interpret.
MLdeck Data Quality profiles column types, summarizes numeric and categorical signals, and highlights review areas such as high-cardinality fields, constant columns, and suspicious schema patterns. The goal is guidance, not automatic correction.
Use the output to decide whether a column needs documentation, cleanup, exclusion, or a second look from someone who understands the data source.
AutoML can train models, but it cannot replace human understanding of what each column means. Schema review helps catch confusing inputs before model metrics are interpreted.