Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cat aa ab ac 与python运行脚本生成的dataset.pkl不相同?大小差了两倍左右?! #62

Open
Ming-Qin-tech opened this issue Nov 8, 2019 · 4 comments

Comments

@Ming-Qin-tech
Copy link

python build_dataset.py

We put a processed data 'dataset.pkl' in DeepInterestNetwork/din. Considering the GitHub's file size limit of 100.00 MB, we split it into 3 file aa ab ac.

cat aa ab ac > dataset.pkl
按道理讲,两个dataset不应该是一样的吗?cat aa ab ac 不是仅仅是为了读者方便吗?

@ghost
Copy link

ghost commented Dec 27, 2019

cat aa ab ac > dataset.pkl 生成的文件为223,497KB,bulid_dataset.py生成的文件为84,779KB,你是这样吗?

@Ming-Qin-tech
Copy link
Author

太久了,不记得了,不好意思,

@YoungsonZhao
Copy link

cat aa ab ac > dataset.pkl 生成的文件为223,497KB,bulid_dataset.py生成的文件为84,779KB,你是这样吗?

我这是这样的,不知道为何

@ucasiggcas
Copy link

cat aa ab ac > dataset.pkl 生成的文件为223,497KB,bulid_dataset.py生成的文件为84,779KB,你是这样吗?

我的和你的差不多。难道原来的数据集不一样??或者pickle版本升级了?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants