流程被神秘地消耗了两次

huangapple go评论86阅读模式
英文:

Stream mysteriously consumed twice

问题

以下是您提供的代码的翻译部分:

以下代码最终会出现java.lang.IllegalStateException: stream has already been operated upon or closed错误。

  1. public static void main(String[] args) {
  2. Stream.concat(Stream.of("FOOBAR"),
  3. reverse(StreamSupport.stream(new File("FOO/BAR").toPath().spliterator(), true)
  4. .map(Path::toString)));
  5. }
  6. static <T> Stream<T> reverse(Stream<T> stream) {
  7. return stream.reduce(Stream.empty(),
  8. (Stream<T> a, T b) -> Stream.concat(Stream.of(b), a),
  9. (a, b) -> Stream.concat(b, a));
  10. }

明显的解决方法是使用StreamSupport.stream(…, false)生成非并行流,但我无法理解为什么不能并行运行。

英文:

The following code ends up with a java.lang.IllegalStateException: stream has already been operated upon or closed.

  1. public static void main(String[] args) {
  2. Stream.concat(Stream.of(&quot;FOOBAR&quot;),
  3. reverse(StreamSupport.stream(new File(&quot;FOO/BAR&quot;).toPath().spliterator(), true)
  4. .map(Path::toString)));
  5. }
  6. static &lt;T&gt; Stream&lt;T&gt; reverse(Stream&lt;T&gt; stream) {
  7. return stream.reduce(Stream.empty(),
  8. (Stream&lt;T&gt; a, T b) -&gt; Stream.concat(Stream.of(b), a),
  9. (a, b) -&gt; Stream.concat(b, a));
  10. }

The obvious solution is to generate a non parallel stream with StreamSupport.stream(…, false), but I can’t see why can’t run in parallel.

答案1

得分: 4

Stream.empty()不是一个常量。这个方法在每次调用时都会返回一个新的流实例,就像任何其他流一样被消耗,例如当你将它传递给Stream.concat

因此,Stream.empty()不适合作为reduce标识值,因为标识值可能会被传递给约定意义模糊的规约函数任意次数的输入。这是一个实现细节,它恰好在串行规约时仅被使用一次,而在并行规约时可能被使用多次。

你可以使用:

  1. static <T> Stream<T> reverse(Stream<T> stream) {
  2. return stream.map(Stream::of)
  3. .reduce((a, b) -> Stream.concat(b, a))
  4. .orElseGet(Stream::empty);
  5. }

代替。

然而,我只将这个解决方案作为学术练习提供。一旦流变得很大,它会导致大量的concat调用,并且文档中的注释适用:

在构造重复连接的流时要小心。访问深度连接流的元素可能导致深层调用链,甚至可能导致StackOverflowError

一般情况下,当以这种方式使用流API时,生成的底层数据结构会比平面列表昂贵得多。

你可以使用类似这样的代码:

  1. Stream<String> s = Stream.concat(Stream.of("FOOBAR"),
  2. reverse(new File("FOO/BAR").toPath()).map(Path::toString));
  1. static Stream<Path> reverse(Path p) {
  2. ArrayDeque<Path> d = new ArrayDeque<>();
  3. p.forEach(d::addFirst);
  4. return d.stream();
  5. }

或者

  1. static Stream<Path> reverse(Path p) {
  2. Stream.Builder b = Stream.builder();
  3. for(; p != null; p = p.getParent()) b.add(p.getFileName());
  4. return b.build();
  5. }

在Java 9+中,你可以使用一个真正没有额外存储的流(这不一定意味着它会更高效):

  1. static Stream<Path> reverse(Path p) {
  2. return Stream.iterate(p, Objects::nonNull, Path::getParent).map(Path::getFileName);
  3. }
英文:

Stream.empty() is not a constant. This method returns a new stream instance on each invocation that will get consumed like any other stream, e.g. when you pass it into Stream.concat.

Therefore, Stream.empty() is not suitable as identity value for reduce, as the identity value may get passed as input to the reduction function an arbitrary, intentionally unspecified number of times. It’s an implementation detail that is happens to be used only a single time for sequential reduction and potentially multiple times for parallel reduction.

You can use

  1. static &lt;T&gt; Stream&lt;T&gt; reverse(Stream&lt;T&gt; stream) {
  2. return stream.map(Stream::of)
  3. .reduce((a, b) -&gt; Stream.concat(b, a))
  4. .orElseGet(Stream::empty);
  5. }

instead.

However, I only provide the solution as an academic exercise. As soon as the stream gets large, it leads to an excessive amount of concat calls and the note of the documentation applies:

> Use caution when constructing streams from repeated concatenation. Accessing an element of a deeply concatenated stream can result in deep call chains, or even StackOverflowError.

Generally, the resulting underlying data structure will be far more expensive than a flat list, when using the Stream API this way.

You can use something like

  1. Stream&lt;String&gt; s = Stream.concat(Stream.of(&quot;FOOBAR&quot;),
  2. reverse(new File(&quot;FOO/BAR&quot;).toPath()).map(Path::toString));
  1. static Stream&lt;Path&gt; reverse(Path p) {
  2. ArrayDeque&lt;Path&gt; d = new ArrayDeque&lt;&gt;();
  3. p.forEach(d::addFirst);
  4. return d.stream();
  5. }

or

  1. static Stream&lt;Path&gt; reverse(Path p) {
  2. Stream.Builder b = Stream.builder();
  3. for(; p != null; p = p.getParent()) b.add(p.getFileName());
  4. return b.build();
  5. }

With Java 9+ you can use a stream that truly has no additional storage (which does not necessarily imply that it will be more efficient):

  1. static Stream&lt;Path&gt; reverse(Path p) {
  2. return Stream.iterate(p, Objects::nonNull, Path::getParent).map(Path::getFileName);
  3. }

huangapple
  • 本文由 发表于 2020年10月16日 23:16:48
  • 转载请务必保留本文链接:https://go.coder-hub.com/64391930.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定