在你的运行中，ChatGPT 5 平均响应更快。你的每个挑战的计时数据已包含。

我需要审查它们生成的代码吗？

是的。两个模型都可能出现小错误；始终运行测试，并为 I/O 和网络代码添加保护措施。

哪个在处理文件、图像和网络时更健壮？

ChatGPT 5。它倾向于添加入口点、超时、流处理，以及更好的图像重采样。

Grok 4 有优势吗？

有，在你已了解上下文并希望最少输出时，它能生成更快捷简洁的脚本。

哪种提示风格效果最好？

明确说明输入/输出、库和边界情况（例如：“使用带超时的 requests，打印 JSON 解析错误”）。

我可以依赖其中任何一个模型来生成生产代码吗？

将它们作为加速器，而不是替代品：在流程中保留测试、代码检查和安全审查。

OpenAI ChatGPT 5 vs Grok 4 - 哪个能生成更好的Python代码？

Item: ChatGPT 5
Rating: 4.5
Author: Mecanik

说到 AI 辅助编码，OpenAI ChatGPT 5 和 Grok 4 是 2025 年最热门的两款工具。两者都声称能够以高精度和高效率编写 Python 代码，但在实际场景中，究竟哪一款能够真正交付更简洁、更易于维护的脚本呢？

在本文中，我们将在多项 Python 挑战赛中对 ChatGPT 5 和 Grok 4 进行对比。我们将考察它们的代码正确性、可读性、执行速度以及在应对棘手问题时的适应性。

为什么要比较 ChatGPT 5 和 Grok 4 的 Python 版本？

Python 仍然是 AI、自动化和数据科学领域的首选语言。现在，许多开发者依靠 AI 来加速他们的编码工作流程、更快地进行调试，甚至从头开始生成完整的应用程序。

虽然 ChatGPT 长期以来一直主导着这一领域，但 Grok 也因其代码清晰和输出简洁而备受关注。两者采用不同的训练理念：ChatGPT 注重推理，而 Grok 则注重简洁。

测试 OpenAI ChatGPT 5 和 Grok 4

我们将通过 10 个 Python 代码测试用例来比较这两个 AI：

基本脚本（偶数求和）
数据处理（CSV 总计）
算法问题（第一个非重复字符）
API 集成（比特币价格获取）
网页抓取（Hacker News 标题）
文件压缩（.txt 文件压缩）
图像处理（调整 JPEG 大小）
多线程（并发下载）
数据库交互（SQLite CRUD）
单元测试（素数测试）

所有测试题目将完全相同，并提交给 ChatGPT 5 和 Grok 4。您可以自行运行测试来验证结果。

Python 挑战 1 - 基础脚本

挑战 1 提示：

编写一个 Python 脚本，读取一个整数列表并输出偶数之和。

挑战 1 GPT 5 代码：

 1# Script to read a list of integers and output the sum of even numbers
 2def sum_of_even_numbers(numbers):
 3    return sum(num for num in numbers if num % 2 == 0)
 4
 5if __name__ == "__main__":
 6    # Read input from the user
 7    try:
 8        numbers = list(map(int, input("Enter integers separated by spaces: ").split()))
 9        total_even = sum_of_even_numbers(numbers)
10        print("Sum of even numbers:", total_even)
11    except ValueError:
12        print("Please enter valid integers only.")

挑战 1 Grok 4 代码：

 1# Read a line of input containing space-separated integers
 2input_line = input("Enter a list of integers separated by spaces: ")
 3
 4# Convert the input string into a list of integers
 5numbers = list(map(int, input_line.split()))
 6
 7# Initialize sum of even numbers
 8even_sum = 0
 9
10# Iterate through the list and add even numbers to the sum
11for num in numbers:
12    if num % 2 == 0:
13        even_sum += num
14
15# Output the sum
16print("Sum of even numbers:", even_sum)

挑战 1 观察：

GPT 5 速度：响应速度非常快，几乎即时完成。
GPT 5 代码：提供更短、更优雅的代码，包含入口点和注释。
Grok 4 速度：响应速度较慢，“思考”时间稍长（约 2 秒）。
Grok 4 代码：提供稍长的代码，没有入口点，但包含注释。

Python 挑战 2 - 数据处理

挑战 2 提示：

加载交易记录的 CSV 文件并打印每件产品的总销售额。

挑战 2 GPT 5 代码：

 1import csv
 2from collections import defaultdict
 3
 4def total_sales_per_product(csv_file):
 5    sales = defaultdict(float)
 6
 7    with open(csv_file, newline='', encoding='utf-8') as f:
 8        reader = csv.DictReader(f)
 9        for row in reader:
10            try:
11                product = row['product']
12                quantity = float(row['quantity'])
13                price = float(row['price'])
14                sales[product] += quantity * price
15            except (KeyError, ValueError):
16                print(f"Skipping invalid row: {row}")
17
18    return sales
19
20if __name__ == "__main__":
21    file_path = "transactions.csv"  # Change to your file path
22    sales_totals = total_sales_per_product(file_path)
23
24    print("Total Sales Per Product:")
25    for product, total in sales_totals.items():
26        print(f"{product}: ${total:,.2f}")

挑战 2 Grok 4 代码：

 1import pandas as pd
 2
 3# Prompt user for the CSV file path
 4file_path = input("Enter the path to the CSV file: ")
 5
 6# Load the CSV file into a DataFrame
 7df = pd.read_csv(file_path)
 8
 9# Assuming the CSV has columns 'Product' and 'Sales'
10# Group by 'Product' and sum the 'Sales'
11total_sales = df.groupby('Product')['Sales'].sum()
12
13# Print the total sales per product
14print("Total sales per product:")
15print(total_sales)

挑战 2 观察：

GPT 5 速度：响应速度非常快，几乎是即时的。
GPT 5 代码：提供了稍长且优雅的代码，包含入口点，但没有注释。
Grok 4 速度：响应速度较慢，“思考”起来稍长（约 13 秒）。
Grok 4 代码：提供了更简单的代码，没有入口点，但包含注释。

Python 挑战 3 - 算法题

挑战 3 提示：

给定一个字符串，返回第一个非重复字符。

挑战 3 GPT 5 代码：

 1def first_non_repeating_char(s):
 2    from collections import Counter
 3    counts = Counter(s)
 4
 5    for char in s:
 6        if counts[char] == 1:
 7            return char
 8    return None  # If no non-repeating character found
 9
10# Example usage
11text = "swiss"
12result = first_non_repeating_char(text)
13if Code:
14    print(f"The first non-repeating character is: '{result}'")
15else:
16    print("No non-repeating character found.")

挑战 3 Grok 4 代码：

 1import collections
 2
 3def first_non_repeating_char(s):
 4    """
 5    Returns the first non-repeating character in the string.
 6    If no such character exists, returns None.
 7    """
 8    count = collections.Counter(s)
 9    for char in s:
10        if count[char] == 1:
11            return char
12    return None
13
14# Example usage
15input_string = input("Enter a string: ")
16result = first_non_repeating_char(input_string)
17print("First non-repeating character:", result)

挑战 3 观察：

GPT 5 速度：响应速度非常快，几乎即时完成。
GPT 5 代码：提供的短代码没有入口点，也没有注释。示例用法存在错误。
Grok 4 速度：响应速度较慢，“思考”时间稍长（约 8 秒）。
Grok 4 代码：提供的短代码没有入口点，但带有简短的文档字符串。

Python 挑战 4 - API 集成

挑战 4 提示：

使用请求库从公共 API 获取当前比特币的美元价格并打印出来。

挑战 4 GPT 5 代码：

 1import requests
 2
 3def get_bitcoin_price_usd():
 4    url = "https://api.coindesk.com/v1/bpi/currentprice/USD.json"
 5    try:
 6        response = requests.get(url, timeout=5)
 7        response.raise_for_status()  # Raise HTTPError for bad responses
 8        data = response.json()
 9        price = data["bpi"]["USD"]["rate"]
10        print(f"Current Bitcoin Price: ${price} USD")
11    except requests.exceptions.RequestException as e:
12        print(f"Error fetching Bitcoin price: {e}")
13    except (KeyError, ValueError) as e:
14        print(f"Error parsing response: {e}")
15
16if __name__ == "__main__":
17    get_bitcoin_price_usd()

挑战 4 Grok 4 代码：

 1import requests
 2
 3# Define the API endpoint
 4url = "https://api.coingecko.com/api/v3/simple/price?ids=bitcoin&vs_currencies=usd"
 5
 6# Fetch the data
 7response = requests.get(url)
 8
 9# Check if the request was successful
10if response.status_code == 200:
11    data = response.json()
12    bitcoin_price = data['bitcoin']['usd']
13    print(f"Current Bitcoin price in USD: ${bitcoin_price}")
14else:
15    print(f"Error fetching data: {response.status_code}")

挑战 4 观察：

GPT 5 速度：响应速度非常快，几乎是即时的。
GPT 5 代码：提供简短优雅的代码，包含入口点，但没有注释。
Grok 4 速度：响应速度较慢，“思考”起来稍长一些（约 23 秒）。
Grok 4 代码：提供的代码略短，没有入口点，但包含注释。

Python 挑战 5 - 网页抓取

挑战 5 提示：

使用请求和 BeautifulSoup 从 https://news.ycombinator.com/ 抓取排名前 5 的新闻标题并打印出来。

挑战 5 GPT 5 代码：

 1import requests
 2from bs4 import BeautifulSoup
 3
 4def get_top_hn_headlines(limit=5):
 5    url = "https://news.ycombinator.com/"
 6    try:
 7        response = requests.get(url, timeout=5)
 8        response.raise_for_status()
 9        
10        soup = BeautifulSoup(response.text, "html.parser")
11        headlines = [a.get_text() for a in soup.select(".titleline > a")]
12        
13        print(f"Top {limit} Hacker News Headlines:")
14        for i, title in enumerate(headlines[:limit], 1):
15            print(f"{i}. {title}")
16    except requests.exceptions.RequestException as e:
17        print(f"Error fetching Hacker News: {e}")
18
19if __name__ == "__main__":
20    get_top_hn_headlines()

挑战 5 Grok 4 代码：

 1import requests
 2from bs4 import BeautifulSoup
 3
 4# Define the URL
 5url = 'https://news.ycombinator.com/'
 6
 7# Fetch the webpage
 8response = requests.get(url)
 9response.raise_for_status()  # Raise an error for bad status codes
10
11# Parse the HTML content
12soup = BeautifulSoup(response.text, 'html.parser')
13
14# Find the top 5 story elements
15stories = soup.find_all('tr', class_='athing')[:5]
16
17# Print the top 5 headlines
18print("Top 5 News Headlines from Hacker News:")
19for i, story in enumerate(stories, 1):
20    title_span = story.find('span', class_='titleline')
21    if title_span:
22        headline = title_span.find('a').text
23        print(f"{i}. {headline}")

挑战 5 观察：

GPT 5 速度：响应速度非常快，几乎是即时的。
GPT 5 代码：提供简短优雅的代码，包含入口点，但没有注释。
Grok 4 速度：响应速度较慢，“感觉”稍长一些（约 13 秒）。
Grok 4 代码：提供稍长的代码，没有入口点，但包含注释。

Python 挑战 6 - 文件压缩

挑战 6 提示：

创建一个 Python 脚本，将当前目录中的所有 .txt 文件压缩到“archive.zip”文件中。

挑战 6 GPT 5 代码：

 1import zipfile
 2import glob
 3import os
 4
 5def zip_text_files(output_zip="archive.zip"):
 6    txt_files = glob.glob("*.txt")
 7    if not txt_files:
 8        print("No .txt files found in the current directory.")
 9        return
10    
11    with zipfile.ZipFile(output_zip, "w") as zipf:
12        for file in txt_files:
13            zipf.write(file, os.path.basename(file))
14            print(f"Added: {file}")
15    
16    print(f"Created {output_zip} containing {len(txt_files)} files.")
17
18if __name__ == "__main__":
19    zip_text_files()

挑战 6 Grok 4 代码：

 1import os
 2from zipfile import ZipFile
 3
 4# Get the current directory
 5current_dir = os.getcwd()
 6
 7# List all .txt files in the current directory
 8txt_files = [f for f in os.listdir(current_dir) if f.endswith('.txt')]
 9
10# Create a zip file and add the .txt files
11with ZipFile('archive.zip', 'w') as zipf:
12    for file in txt_files:
13        zipf.write(file)
14
15print(f"Successfully zipped {len(txt_files)} .txt files into 'archive.zip'.")

挑战 6 观察：

GPT 5 速度：响应速度非常快，几乎即时完成。
GPT 5 代码：提供简短优雅的代码，包含入口点，但没有注释。
Grok 4 速度：响应速度较慢，“思考”时间稍长（约 10 秒）。
Grok 4 代码：提供简短的代码，没有入口点，但包含注释。

Python 挑战 7 - 图像处理

挑战 7 提示：

使用 Pillow (PIL) 将当前目录中的所有 JPEG 图像调整为 800x600 像素，并将其保存到“输出”文件夹。

挑战 7 GPT 5 代码：

 1import os
 2from PIL import Image
 3import glob
 4
 5def resize_jpeg_images(width=800, height=600, output_dir="output"):
 6    # Create output folder if it doesn't exist
 7    os.makedirs(output_dir, exist_ok=True)
 8
 9    # Find all .jpg and .jpeg files (case-insensitive)
10    jpeg_files = glob.glob("*.jpg") + glob.glob("*.jpeg") + glob.glob("*.JPG") + glob.glob("*.JPEG")
11
12    if not jpeg_files:
13        print("No JPEG images found in the current directory.")
14        return
15
16    for file in jpeg_files:
17        try:
18            with Image.open(file) as img:
19                img_resized = img.resize((width, height), Image.LANCZOS)
20                output_path = os.path.join(output_dir, os.path.basename(file))
21                img_resized.save(output_path, "JPEG")
22                print(f"Resized and saved: {output_path}")
23        except Exception as e:
24            print(f"Error processing {file}: {e}")
25
26if __name__ == "__main__":
27    resize_jpeg_images()

挑战 7 Grok 4 代码：

 1import os
 2from PIL import Image
 3
 4# Get the current directory
 5current_dir = os.getcwd()
 6
 7# Create the output folder if it doesn't exist
 8output_folder = os.path.join(current_dir, 'output')
 9os.makedirs(output_folder, exist_ok=True)
10
11# List all JPEG files in the current directory (case-insensitive)
12jpeg_files = [f for f in os.listdir(current_dir) 
13              if f.lower().endswith(('.jpg', '.jpeg'))]
14
15# Resize each image and save to output folder
16resized_count = 0
17for file in jpeg_files:
18    try:
19        img_path = os.path.join(current_dir, file)
20        img = Image.open(img_path)
21        resized_img = img.resize((800, 600))
22        output_path = os.path.join(output_folder, file)
23        resized_img.save(output_path)
24        resized_count += 1
25    except Exception as e:
26        print(f"Error processing {file}: {e}")
27
28print(f"Successfully resized and saved {resized_count} JPEG images to the 'output' folder.")

挑战 7 观察：

GPT 5 速度：响应速度非常快，几乎是即时的。
GPT 5 代码：提供简短优雅的代码，包含入口点和注释。
Grok 4 速度：响应速度较慢，“思考”起来稍长一些（约 10 秒）。
Grok 4 代码：提供稍长的代码，没有入口点，但包含注释。

Python 挑战 8 - 多线程

挑战 8 提示：

创建一个 Python 脚本，使用线程技术从给定的 URL 同时下载 5 个不同的文件。

挑战 8 GPT 5 代码：

 1import os
 2from pathlib import Path
 3from urllib.parse import urlparse
 4from concurrent.futures import ThreadPoolExecutor, as_completed
 5
 6import requests
 7
 8CHUNK_SIZE = 1024 * 64  # 64KB
 9
10
11def unique_path(dest_dir: Path, filename: str) -> Path:
12    """Return a unique path inside dest_dir by appending (1), (2), ... if needed."""
13    dest = dest_dir / filename
14    if not dest.exists():
15        return dest
16    stem, suffix = os.path.splitext(filename)
17    i = 1
18    while True:
19        candidate = dest_dir / f"{stem} ({i}){suffix}"
20        if not candidate.exists():
21            return candidate
22        i += 1
23
24
25def filename_from_url(url: str, index: int) -> str:
26    """Get a safe filename from URL; fall back to file_{index} if missing."""
27    path = urlparse(url).path
28    name = os.path.basename(path).strip()
29    return name or f"file_{index}"
30
31
32def download_file(url: str, dest_dir: Path, index: int, timeout: int = 15) -> Path:
33    """Download a single file to dest_dir. Returns the saved path."""
34    dest_dir.mkdir(parents=True, exist_ok=True)
35    base_name = filename_from_url(url, index)
36    dest_path = unique_path(dest_dir, base_name)
37
38    headers = {"User-Agent": "python-threaded-downloader/1.0"}
39    with requests.get(url, stream=True, timeout=timeout, headers=headers) as r:
40        r.raise_for_status()
41        with open(dest_path, "wb") as f:
42            for chunk in r.iter_content(chunk_size=CHUNK_SIZE):
43                if chunk:  # filter out keep-alive chunks
44                    f.write(chunk)
45    return dest_path
46
47
48def download_concurrently(urls, out_dir="downloads", max_workers=5):
49    """Download given URLs concurrently using threads."""
50    dest_dir = Path(out_dir)
51    results = []
52    errors = []
53
54    with ThreadPoolExecutor(max_workers=max_workers) as executor:
55        future_to_url = {
56            executor.submit(download_file, url, dest_dir, i): url
57            for i, url in enumerate(urls, start=1)
58        }
59        for future in as_completed(future_to_url):
60            url = future_to_url[future]
61            try:
62                saved_path = future.result()
63                print(f"✅ Downloaded: {url} -> {saved_path}")
64                results.append(saved_path)
65            except Exception as e:
66                print(f"❌ Failed: {url} ({e})")
67                errors.append((url, e))
68    print(f"\nDone. {len(results)} succeeded, {len(errors)} failed.")
69    return results, errors
70
71
72if __name__ == "__main__":
73    # Replace these with your 5 URLs
74    urls = [
75        "https://example.com/file1.jpg",
76        "https://example.com/file2.pdf",
77        "https://example.com/file3.zip",
78        "https://example.com/file4.png",
79        "https://example.com/file5.mp4",
80    ]
81    download_concurrently(urls, out_dir="downloads", max_workers=5)

挑战 8 Grok 4 代码：

 1import threading
 2import urllib.request
 3
 4def download_file(url, filename):
 5    """
 6    Downloads a file from the given URL and saves it with the specified filename.
 7    """
 8    try:
 9        urllib.request.urlretrieve(url, filename)
10        print(f"Downloaded {filename} from {url}")
11    except Exception as e:
12        print(f"Error downloading {url}: {e}")
13
14# List of 5 example URLs (replace with your actual URLs)
15urls = [
16    "https://www.example.com/file1.txt",
17    "https://www.example.com/file2.pdf",
18    "https://www.example.com/file3.jpg",
19    "https://www.example.com/file4.zip",
20    "https://www.example.com/file5.docx"
21]
22
23# Corresponding filenames to save the downloads
24filenames = [
25    "file1.txt",
26    "file2.pdf",
27    "file3.jpg",
28    "file4.zip",
29    "file5.docx"
30]
31
32# Create and start threads for concurrent downloads
33threads = []
34for url, filename in zip(urls, filenames):
35    thread = threading.Thread(target=download_file, args=(url, filename))
36    thread.start()
37    threads.append(thread)
38
39# Wait for all threads to complete
40for thread in threads:
41    thread.join()
42
43print("All downloads completed.")

挑战 8 观察：

GPT 5 速度：响应速度较慢，“思考”时间稍长（约 10 秒）。
GPT 5 代码：提供包含入口点和注释的长而优雅的代码。
Grok 4 速度：响应速度更慢，“思考”时间稍长（约 21 秒）。
Grok 4 代码：提供更短的代码，不包含入口点，但包含注释。

Python 挑战 9 - 数据库交互

挑战 9 提示：

连接到 SQLite 数据库，创建“users”表，插入示例数据，并查询所有用户。

挑战 9 GPT 5 代码：

 1import sqlite3
 2
 3def main():
 4    # Connect to (or create) the database
 5    conn = sqlite3.connect("example.db")
 6    cursor = conn.cursor()
 7
 8    # Create the users table if it doesn't exist
 9    cursor.execute("""
10        CREATE TABLE IF NOT EXISTS users (
11            id INTEGER PRIMARY KEY AUTOINCREMENT,
12            name TEXT NOT NULL,
13            email TEXT UNIQUE NOT NULL
14        )
15    """)
16
17    # Insert sample data
18    sample_users = [
19        ("Alice", "[email protected]"),
20        ("Bob", "[email protected]"),
21        ("Charlie", "[email protected]"),
22    ]
23
24    try:
25        cursor.executemany("INSERT INTO users (name, email) VALUES (?, ?)", sample_users)
26    except sqlite3.IntegrityError:
27        # Ignore duplicates if run multiple times
28        pass
29
30    # Commit changes
31    conn.commit()
32
33    # Query all users
34    cursor.execute("SELECT id, name, email FROM users")
35    users = cursor.fetchall()
36
37    print("Users in database:")
38    for user in users:
39        print(user)
40
41    # Close the connection
42    conn.close()
43
44if __name__ == "__main__":
45    main()

Challenge 9 Grok 4 Code:

 1import sqlite3
 2
 3# Connect to an in-memory SQLite database
 4conn = sqlite3.connect(':memory:')
 5cursor = conn.cursor()
 6
 7# Create the 'users' table
 8cursor.execute('''
 9    CREATE TABLE users (
10        id INTEGER PRIMARY KEY,
11        name TEXT,
12        age INTEGER
13    )
14''')
15
16# Insert sample data
17cursor.execute("INSERT INTO users (name, age) VALUES ('Alice', 30)")
18cursor.execute("INSERT INTO users (name, age) VALUES ('Bob', 25)")
19cursor.execute("INSERT INTO users (name, age) VALUES ('Charlie', 35)")
20
21# Commit the changes
22conn.commit()
23
24# Query all users
25cursor.execute("SELECT * FROM users")
26results = cursor.fetchall()
27
28# Print the results
29print("ID | Name | Age")
30print("----------------")
31for row in results:
32    print(f"{row[0]} | {row[1]} | {row[2]}")
33
34# Close the connection
35conn.close()

挑战 9 观察：

GPT 5 速度：响应速度非常快，几乎是即时的。即使我们只是直截了当地说“连接……”而不是“写入或创建”，我们也理解需要脚本。
GPT 5 代码：提供了长而优雅的代码，包含入口点和注释。
Grok 4 速度：响应速度较慢，“思考”时间稍长（约 16 秒）。
Grok 4 代码：提供了略短的代码，并且首次提供了入口点和注释。有趣的是，一些函数/测试名称与 ChatGPT 5 的风格相似。

Python 挑战 10 - 单元测试

挑战 10 提示：

编写一个 Python 单元测试，验证函数“is_prime(n)”是否正确识别素数。

挑战 10 GPT 5 代码：

 1import unittest
 2
 3# Example implementation of is_prime for testing purposes
 4# If you already have this function, you can remove this definition.
 5def is_prime(n):
 6    """Return True if n is a prime number, else False."""
 7    if n <= 1:
 8        return False
 9    if n <= 3:
10        return True
11    if n % 2 == 0 or n % 3 == 0:
12        return False
13    i = 5
14    while i * i <= n:
15        if n % i == 0 or n % (i + 2) == 0:
16            return False
17        i += 6
18    return True
19
20
21class TestIsPrime(unittest.TestCase):
22    def test_primes(self):
23        primes = [2, 3, 5, 7, 11, 13, 17, 19, 23, 29]
24        for p in primes:
25            with self.subTest(p=p):
26                self.assertTrue(is_prime(p), f"{p} should be prime")
27
28    def test_non_primes(self):
29        non_primes = [0, 1, 4, 6, 8, 9, 10, 12, 15, 20, 25]
30        for n in non_primes:
31            with self.subTest(n=n):
32                self.assertFalse(is_prime(n), f"{n} should not be prime")
33
34if __name__ == "__main__":
35    unittest.main()

挑战 10 Grok 4 代码：

 1import unittest
 2
 3# Assuming the is_prime function is defined in a module named prime_checker
 4# Replace 'prime_checker' with the actual module name where is_prime is defined
 5from prime_checker import is_prime
 6
 7class TestIsPrime(unittest.TestCase):
 8    def test_prime_numbers(self):
 9        """Test that prime numbers are correctly identified."""
10        primes = [2, 3, 5, 7, 11, 13, 17, 19, 23, 29]
11        for num in primes:
12            with self.subTest(num=num):
13                self.assertTrue(is_prime(num), f"{num} should be prime")
14
15    def test_non_prime_numbers(self):
16        """Test that non-prime numbers are correctly identified."""
17        non_primes = [0, 1, 4, 6, 8, 9, 10, 12, 14, 15, 16, 18, 20, 21, 22, 24, 25, 26, 27, 28]
18        for num in non_primes:
19            with self.subTest(num=num):
20                self.assertFalse(is_prime(num), f"{num} should not be prime")
21
22    def test_negative_numbers(self):
23        """Test that negative numbers are not considered prime."""
24        negatives = [-1, -2, -3, -5, -7]
25        for num in negatives:
26            with self.subTest(num=num):
27                self.assertFalse(is_prime(num), f"{num} should not be prime")
28
29    def test_one_and_zero(self):
30        """Explicitly test 0 and 1, which are not prime."""
31        self.assertFalse(is_prime(0))
32        self.assertFalse(is_prime(1))
33
34if __name__ == '__main__':
35    unittest.main()

挑战 10 的观察结果

GPT 5 速度：响应速度非常快，几乎是即时的。
GPT 5 代码：提供长而优雅的代码，包含入口点和注释。
Grok 4 速度：响应速度较慢，“思考”时间稍长（约 12 秒）。
Grok 4 代码：提供的代码略短，并且首次包含入口点和注释。有趣的是，函数名称与 ChatGPT 5 相同。

评估标准

我们将根据以下标准对每个 AI 进行评分：

代码正确性：无需进行重大修改即可运行吗？
可读性：代码是否简洁且注释清晰？
效率：是否使用了最佳方法？
错误处理：是否能够预测可能的故障？
可解释性：是否提供了清晰的推理？

初步观察

根据以往和当前的经验：

ChatGPT 5 倾向于提供更冗长、文档更完善的代码，速度也更快。
Grok 4 更倾向于极简主义，响应速度较慢，有时会省略注释。

两者都擅长标准任务，但 Grok 在处理多步骤推理提示时可能会遇到困难。

10 项挑战的突出之处

提示执行情况：GPT-5 始终专注于任务（例如，BTC API）。Grok 4 偶尔会出现偏差（挑战 4）。
入口点和结构：GPT-5 始终使用入口点和辅助函数；Grok 4 经常编写没有入口点的单文件脚本。
错误处理：GPT-5 更频繁地添加超时/raise_for_status/try-except；Grok 4 则倾向于极简主义。
依赖关系与假设：GPT-5 尽可能使用 stdlib；Grok 4 则依赖于 Pandas 或更简单的 urllib 默认库。
数据模型假设：GPT-5 推断字段和计算值（数量 × 价格）；Grok 4 假设列已预先聚合。
算法谨慎：两者都解决了逻辑任务；GPT-5 的示例存在一个小的变量错误，而 Grok 4 的 HN 示例存在语法错误。
性能态势：GPT-5 使用流式传输 + 线程池进行下载；Grok 4 使用原始线程 + urlretrieve（更简单，但鲁棒性较差）。

结论

虽然这两种工具都可以编写函数式 Python 代码，但最终选择可能取决于开发者的偏好：

如果您注重详细的解释、循序渐进的推理、快速的代码生成和丰富的注释，请选择ChatGPT 5。
如果您更喜欢简洁、简单的代码，尽量减少冗余代码，并且代码生成速度较慢，请选择Grok 4。

说实话，我更喜欢ChatGPT 5，因为它的响应速度更快，并且能够生成更好、更详细的Python代码。抱歉了，埃隆·马斯克。

常见问题解答 (FAQ)

OpenAI ChatGPT 5 和 Grok 4 哪个更适合初学者？ ChatGPT 5。它解释得更详细，包含更安全的默认值（超时、错误处理），并且结构更简洁。

在这些测试中，哪个产生的代码问题更少？ ChatGPT 5 总体而言。Grok 4 在抓取数据时偶尔会出现提示漂移和轻微的语法错误。

哪个更快？ 在您的运行中，ChatGPT 5 的平均响应速度更快。您的计时已包含在每次挑战中。

我需要检查它们生成的代码吗？ 是的。两种模型都可能犯一些小错误；请务必运行测试并为 I/O 和网络代码添加防护措施。

哪个处理文件、图像和网络更稳健？ ChatGPT 5。它倾向于添加入口点、超时、流式传输和更好的图像重采样。

Grok 4 有什么优势吗？ 是的，当您已经了解上下文并希望最小化输出时，Grok 4 的脚本更简洁、更快捷。

哪种提示样式效果最好？ 明确说明输入/输出、库和边缘情况（例如：“使用带超时的请求，打印 JSON 解析错误”）。

我可以依赖这两种模型来编写生产代码吗？ 将它们用作加速器，而不是替代品：将测试、linting 和安全审查保留在您的流程中。

OpenAI ChatGPT 5 vs Grok 4 - 哪个能生成更好的Python代码？

为什么要比较 ChatGPT 5 和 Grok 4 的 Python 版本？

测试 OpenAI ChatGPT 5 和 Grok 4

Python 挑战 1 - 基础脚本

Python 挑战 2 - 数据处理

Python 挑战 3 - 算法题

Python 挑战 4 - API 集成

Python 挑战 5 - 网页抓取

Python 挑战 6 - 文件压缩

Python 挑战 7 - 图像处理

Python 挑战 8 - 多线程

Python 挑战 9 - 数据库交互

Python 挑战 10 - 单元测试

评估标准

初步观察

10 项挑战的突出之处

结论

常见问题解答 (FAQ)

评论

发表评论

OpenAI ChatGPT 5 vs Grok 4 - 哪个能生成更好的Python代码？

为什么要比较 ChatGPT 5 和 Grok 4 的 Python 版本？

测试 OpenAI ChatGPT 5 和 Grok 4

Python 挑战 1 - 基础脚本

Python 挑战 2 - 数据处理

Python 挑战 3 - 算法题

Python 挑战 4 - API 集成

Python 挑战 5 - 网页抓取

Python 挑战 6 - 文件压缩

Python 挑战 7 - 图像处理

Python 挑战 8 - 多线程

Python 挑战 9 - 数据库交互

Python 挑战 10 - 单元测试

评估标准

初步观察

10 项挑战的突出之处

结论

常见问题解答 (FAQ)

相关文章

评论

发表评论 取消回复

发表评论