来自字符串或包数据的pandas.read_csv

联想a365•2022-12-14•随笔•阅读8

以下内容在3.3中为我工作：

>>> import numpy as np, pandas as pd>>> import io, pkgutil>>> wells = pkgutil.get_data('pymc.examples', 'data/wells.dat')>>> type(wells)<class 'bytes'>>>> df = pd.read_csv(io.BytesIO(wells), encoding='utf8', sep=" ", index_col="id", dtype={"switch": np.int8})>>> df.head()    switch  arsenic       dist  assoc  educid        1        1     2.36  16.826000      0     02        1     0.71  47.321999      0     03        0     2.07  20.966999      0    104        1     1.15  21.486000      0    125        1     1.10  40.874001      1    14[5 rows x 5 columns]

注意：我必须手动将其放置

wells.dat

在该位置，所以我不能保证我已正确复制了它，并且没有终端空格，因为我删除了一些空格。但经过

read_csv

一个

BytesIO

对象，编码参数应该工作。（实际上，没有它，您可能会逃脱，但这是一个好习惯。

io.TextIOWrapper

可能是另一种选择。）

欢迎分享，转载请注明来源：内存溢出

原文地址:https://www.54852.com/zaji/5587662.html

空格工作可能会终端字符串

打赏

微信扫一扫

支付宝扫一扫

联想a365一级用户组

0 0

从Python调用Perl脚本

上一篇 2022-12-15

如何在被Python杀死之前运行最后一个函数？

下一篇2022-12-15

发表评论

登录后才能评论

来自字符串或包数据的pandas.read_csv

发表评论

评论列表（0条）