英文:
Can a python function be both a generator and a "non-generator"?
问题
我有一个函数,我想要从中生成字节(生成器行为),并根据`save`布尔值是否设置来写入文件(非生成器行为)。这是否可能?
```python
def encode_file(source, save=False, destination=None):
# 每次处理输入文件的内容3字节
print('hello')
with open(source, 'rb') as infile:
# 将字节保存到目标文件
if save:
print(f'saving to file {destination}')
with open(destination, 'wb') as outfile:
while (bytes_to_encode := infile.read(3)):
l = len(bytes_to_encode)
if l < 3:
bytes_to_encode += (b'\x00' * (3 - l))
outfile.write(bytes_to_encode)
return
# 向调用者生成字节
else:
while (bytes_to_encode := infile.read(3)):
l = len(bytes_to_encode)
if l < 3:
bytes_to_encode += (b'\x00' * (3 - l)) # 如果不足3字节则填充位
yield encode(bytes_to_encode)
return
在上面的实现中,该函数始终表现为生成器。当我调用
encode_file('file.bin', save=True, destination='output.base64')
它不会打印“hello”,而是返回一个生成器对象。这对我来说没有意义。不应该打印“hello”,然后将控制传递到代码的if save:
部分,从而避免完全生成字节的部分吗?
<details>
<summary>英文:</summary>
I have a function which I want to yield bytes from (generator behaviour) and also write to a file (non-generator behaviour) depending on whether the `save` boolean is set. Is that possible?
```python
def encode_file(source, save=False, destination=None):
# encode the contents of an input file 3 bytes at a time
print('hello')
with open(source, 'rb') as infile:
# save bytes to destination file
if save:
print(f'saving to file {destination}')
with open(destination, 'wb') as outfile:
while (bytes_to_encode := infile.read(3)):
l = len(bytes_to_encode)
if l < 3:
bytes_to_encode += (b'\x00' * (3 - l))
outfile.write(bytes_to_encode)
return
# yield bytes to caller
else:
while (bytes_to_encode := infile.read(3)):
l = len(bytes_to_encode)
if l < 3:
bytes_to_encode += (b'\x00' * (3 - l)) # pad bits if short
yield encode(bytes_to_encode)
return
In the above implementation, the function always behaves as a generator. When I call
encode_file('file.bin', save=True, destination='output.base64')
it does not print "hello" instead, it returns a generator object. This does not make sense to me. Shouldn't "hello" be printed and then shouldn't control be directed to the if save:
portion of the code thus avoiding the part of the function that yields completely?
答案1
得分: 3
一个函数既不能是生成器,也不能不是生成器,但当然你可以通过定义一个辅助函数来决定是否返回一个生成器对象。为了避免在两者之间重复(读取)使用with
(并减少冗余),将一个分支作为另一个的客户端:
def encode_file(source, save=False, destination=None):
# 每次处理输入文件的内容3字节
print('hello')
# 将字节保存到目标文件
if save:
print(f'保存到文件 {destination}')
with open(destination, 'wb') as outfile:
for bytes_to_encode in encode_file(source):
outfile.write(bytes_to_encode)
# 将字节提供给调用者
else:
def g():
with open(source, 'rb') as infile:
while (bytes_to_encode := infile.read(3)):
l = len(bytes_to_encode)
if l < 3:
bytes_to_encode += (b'\x00' * (3 - l)) # 如果字节不足,填充位
yield encode(bytes_to_encode)
return g()
(感谢interjay指出了在g
中需要使用with
的需要。)
英文:
A function can’t be a generator and also not be one, but of course you can decide whether to return a generator object or not by defining a helper function. To avoid duplicating the (read) with
between the two (and reduce redundancy in general), make one branch a client of the other:
def encode_file(source, save=False, destination=None):
# encode the contents of an input file 3 bytes at a time
print('hello')
# save bytes to destination file
if save:
print(f'saving to file {destination}')
with open(destination, 'wb') as outfile:
for bytes_to_encode in encode_file(source):
outfile.write(bytes_to_encode)
# yield bytes to caller
else:
def g():
with open(source, 'rb') as infile:
while (bytes_to_encode := infile.read(3)):
l = len(bytes_to_encode)
if l < 3:
bytes_to_encode += (b'\x00' * (3 - l)) # pad bits if short
yield encode(bytes_to_encode)
return g()
(Thanks to interjay for pointing out the need for the with
in g
.)
答案2
得分: 2
你不能即时创建一个函数生成器,但你可以将逻辑提取到一个生成器函数中,然后创建一个单独的函数,该函数要么返回该生成器,要么将其保存到文件中。以下是一个用于演示目的的简单示例,带有注释:
# 作为生成器的示例函数
# 在这里放入所有的逻辑
def getNums_gen(n):
for i in range(n):
yield i
def getNums(n, asList=False):
gen = getNums_gen(n)
# 如果不想保存它,只需返回生成器
if not asList:
return gen
# 如果想保存它,循环遍历生成器并执行保存数据所需的任何操作。
# 在此示例中,我们将其保存为列表,但也可以写入文件
lst = [num for num in gen]
return lst
print(getNums(5, asList=True))
print(getNums(5, asList=False))
英文:
You can't make a function generator on the fly, but you can extract the logic into a generator function and then create a separate function that will either return that generator or save it to a file. Here's a simpler example for demonstration purposess with comments:
# example function as a generator
# put all your logic here
def getNums_gen(n):
for i in range(n):
yield i
def getNums(n, asList=False):
gen = getNums_gen(n)
# if we don't want to save it, just return the generator
if not asList:
return gen
# if we want to save it, loop through the generator and do whatever
# is needed to save the data. in this example, we'll just save it as a
# list, but it could be written to a file too
lst = [num for num in gen]
return lst
print(getNums(5, asList=True))
print(getNums(5, asList=False))
通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库,让每个人都能够通过互相帮助和分享经验来进步。
评论