python:mnist 数据集下载,parse

pip install mnist

pip show mnist

Name: mnist

Version: 0.2.2

Summary: Python utilities to download and parse the MNIST dataset

Home-page: https://github.com/datapythonista/mnist

Author: Marc Garcia

Author-email: garcia.marc@gmail.com

License: BSD

Location: d:\python39\lib\site-packages

Requires: numpy

https://opendatalab.com/MNIST/

复制代码
D:\python>TREE /F data

D:\PYTHON\DATA
└─MNIST
    └─raw
            t10k-images-idx3-ubyte
            t10k-images-idx3-ubyte.gz
            t10k-labels-idx1-ubyte
            t10k-labels-idx1-ubyte.gz
            train-images-idx3-ubyte
            train-images-idx3-ubyte.gz
            train-labels-idx1-ubyte
            train-labels-idx1-ubyte.gz

修改 D:\Python39\Lib\site-packages\mnist\init.py

找到第143行:fname = download_file(fname, target_dir=target_dir, force=force) 注释掉,修改为:

python 复制代码
def download_and_parse_mnist_file(fname, target_dir=None, force=False):
    """Download the IDX file named fname from the URL specified in dataset_url
    and return it as a numpy array.

    Parameters
    ----------
    fname : str
        File name to download and parse

    target_dir : str
        Directory where to store the file

    force : bool
        Force downloading the file, if it already exists

    Returns
    -------
    data : numpy.ndarray
        Numpy array with the dimensions and the data in the IDX file
    """
    #fname = download_file(fname, target_dir=target_dir, force=force)
    if target_dir == None:
        target_dir = './data/MNIST/raw'
    fname = os.path.join(target_dir, fname)
        
    fopen = gzip.open if os.path.splitext(fname)[1] == '.gz' else open
    with fopen(fname, 'rb') as fd:
        return parse_idx(fd)

编写一个测试脚本 mnist_test.py 如下

python 复制代码
import mnist

print('mnist train shape:')
train_X = mnist.train_images()
train_y = mnist.train_labels()
print(train_X.shape, train_y.shape)

print('mnist test shape:')
test_X = mnist.test_images()
test_y = mnist.test_labels()
print(test_X.shape, test_y.shape)

运行 cmd

python mnist_test.py

python 复制代码
D:\python> python mnist_test.py
mnist train shape:
(60000, 28, 28) (60000,)
mnist test shape:
(10000, 28, 28) (10000,)

编写一个显示手写数字图像的脚本 mnist_image1.py 如下

python 复制代码
from PIL import Image
import numpy as np
import mnist

images = mnist.train_images()
img = Image.fromarray(images[0]).resize((280, 280), Image.Resampling.NEAREST)
#img.save('mnist_img1.jpg') # 可选:保存图片
img.show() # 可选:显示图片 

python mnist_image1.py

相关推荐
c_lb728810 分钟前
期货量化策略从 Windows 迁到 Linux 服务器:环境注意点
linux·服务器·windows·python
北凉军15 分钟前
更换PyCharm 任务栏图标
ide·python·pycharm
yijianace19 分钟前
Python爬虫实战:BooksToScrape 多线程爬取与图片下载
开发语言·爬虫·python
LadenKiller19 分钟前
期货多品种轮动标的池:天勤 query_quotes 筛品种写法
python·区块链
郑洁文21 分钟前
基于Python+回归分析的电子产品需求数据分析与预测
python·数据分析·回归·电子产品需求数据·电子产品数据分析
IT策士26 分钟前
Redis 从入门到精通:Python 操作 Redis 进阶
数据库·redis·python
质造者31 分钟前
Python 本地 RAG 实战 | Ollama+ChromaDB 实现 PDF 离线智能问答
开发语言·python·pdf·大模型·rag
骑士雄师32 分钟前
18.1 星系案例:多智能体宇宙探索系统(学习langgraph 的存储知识)
windows·python·学习
m沐沐38 分钟前
【深度学习】PyTorch CNN 手写数字识别(卷积神经网络)
人工智能·pytorch·python·深度学习·机器学习·pycharm·cnn
garmin Chen39 分钟前
Prompt工程入门:让AI按你的要求工作(3)--Prompt工程与提示词安全评测概述
java·人工智能·python·安全·prompt