python - How do you represent na in a Pandas DataFrame? -


does pandas have equivalent of r's na (meaning not available)? if not, convention representing missing value, opposed nan represents mathematically impossible value such divide zero?

currently there no na value available in pandas or numpy. section "working missing data" in pandas manual (http://pandas.pydata.org/pandas-docs/stable/missing_data.html):

the choice of using nan internally denote missing data largely simplicity , performance reasons. differs maskedarray approach of, example, scikits.timeseries. hopeful numpy able provide native na type solution (similar r) performant enough used in pandas.

also, part of documentation (http://pandas.pydata.org/pandas-docs/stable/gotchas.html#nan-integer-na-values-and-na-type-promotions) provides more details on trade-offs in choice of na representation.


Comments

Popular posts from this blog

javascript - How to synchronize the Three.js and HTML/SVG coordinate systems (especially w.r.t. the y-axis)? -

javascript - How do I find how many occurences are there of a highlighted string, and which occurence is it? -

java - Reading data from multiple zip files and combining them to one -