In the last post in this series, we took a look at the new convenience factory methods for collections in Java 9. Collections play well with the Stream API, which learned a few new tricks since its introduction in Java 8 as well. There are 4 particularly exciting new features – let’s explore them with JShell!

1. dropWhile and takeWhile

Let’s take a step back and look at what was available prior to Java 9. We had an API for making streams containing elements. For example, let’s create a stream of numbers 0 to 9:

jshell> IntStream.range(0,10).forEach(System.out::println)
0
1
2
3
4
5
6
7
8
9

Since Java 8, we’ve known limit(long n) and its complementary method skip(long n), which allow us to take and ignore the first n elements of a stream, respectively. For instance:

jshell> IntStream.range(0,10).limit(5).forEach(System.out::println)
0
1
2
3
4
jshell> IntStream.range(0,10).skip(5).forEach(System.out::println)
5
6
7
8
9

These methods are very useful. However, they require us to know the number of elements to limit or skip in advance. Often, we know we want to keep taking or dropping elements while a certain condition is met, but we don’t necessarily know how many of these element we get. That’s exactly the functionality added in Java 9, courtesy of takeWhile and dropWhile methods:

jshell> IntStream.range(0,10).takeWhile(x -> x < 5).forEach(System.out::println)
0
1
2
3
4
jshell> IntStream.range(0,10).dropWhile(x -> x < 5).forEach(System.out::println)
5
6
7
8
9

takeWhile and dropWhile are complementary to each other. They’re basically versions of limit and skip which take a predicate instead of a number.

2. iterate

Another useful addition to the Stream API is the method iterate. The method is overloaded:

jshell> IntStream.iterate(
Signatures:
IntStream IntStream.iterate(int seed, IntUnaryOperator f)
IntStream IntStream.iterate(int seed, IntPredicate hasNext, IntUnaryOperator next)
<press tab again to see documentation>

The version on line 3 was available in JDK 8, whereas the version on line 4 was added in JDK 9. To illustrate the difference, let’s try and solve a simple problem – suppose we want to print all even numbers less than 10.

A naive approach to solving this problem using Java 8 constructs could involve iteration with a filtering step, such as:

jshell> IntStream.iterate(0, i -> i + 2).filter(j -> j < 10).forEach(System.out::println)
0
2
4
6
8
-2147483648
-2147483646
-2147483644
-2147483642
-2147483640
-2147483638
-2147483636
-2147483634
-2147483632
...

As we can see, if we try to run this simple and seemingly straightforward code, we discover it doesn’t work. At first, the correct solution is printed, then the output pauses, and continues to print incorrect numbers after a while.

It’s easy to see what’s happening here. We construct the right filter, but apply it to an infinite stream. After printing the elements matching the filtering condition, the stream silently keeps going. Our integers overflow and start matching the filtering condition again.

Unfortunately, there isn’t a concise and clean way to deal with this situation in JDK 8, which is what the recent release of the JDK fixes. We can now correctly solve our problem as follows:

jshell> IntStream.iterate(0, i -> i < 10, i -> i + 2).forEach(System.out::println)
0
2
4
6
8

If we look at the solution, we can see this version of iterate reads like a for loop, and that’s essentially what it is – a streamified version of for.

3. ofNullable

Another addition to the Stream API worth talking about is the method ofNullable. Prior to Java 9, we could create streams of arbitrary elements using the factory methods. For example:

jshell> Stream.of(1)
$1 ==> java.util.stream.ReferencePipeline$Head@4566e5bd

The result is a stream containing a single element – 1. We couldn’t do the same thing with null though:

jshell> Stream.of(null)
| Warning:
| non-varargs call of varargs method with inexact argument type for last parameter;
| cast to java.lang.Object for a varargs call
| cast to java.lang.Object[] for a non-varargs call and to suppress this warning
| Stream.of(null)
| ^--^
| java.lang.NullPointerException thrown:
| at Arrays.stream (Arrays.java:5610)
| at Stream.of (Stream.java:1187)
| at (#2:1)

This makes sense – we don’t want to have nulls in our streams. Java 9, however, allows us to do the following:

jshell> Stream.ofNullable(null)
$3 ==> java.util.stream.ReferencePipeline$Head@72b6cbcc

The result is not a stream containing a null element, but an empty stream:

jshell> Stream.ofNullable(null).count()
$4 ==> 0

If you’ve worked with streams a lot, you probably immediately realize why this is an exciting feature. It makes integration better, and allows us to construct cleaner streams. Often, when using long streams with many chained operations, particularly maps, we end up with nulls at some point in our stream. Since we don’t want to have nulls in streams, or NullPointerExceptions, we need to filter the null elements out. In practice, this involves a step with a ternary operator or an if statement, none of which are nice. ofNullable enables us to avoid these null checks, which is great!

4. Collectors.filtering and Collectors.flatMapping

Collectors are an essential part of the Stream API. They provide reduction operations which process stream elements into result containers, optionally performing a transformation on the aggregated results, after all input elements have been processed.

Java 9 added 2 new collectors, filtering and flatMapping, which play nice with the groupingBy collector.

Collectors.filtering

Similarly to the filter method on streams, the filtering collector is used for filtering elements in a stream. The filter method, however, processes the values before they’re grouped, whereas Collectors.filtering can be used nicely with Collectors.groupingBy to group the values before the filtering step takes place.

Let’s demonstrate the difference on an example. Suppose we’re given a list of numbers, e.g.:

jshell> List<Integer> numbers = List.of(2, 3, 4, 7, 9, 11)
numbers ==> [2, 3, 4, 7, 9, 11]

Now suppose we need to count the numbers by parity, but we’re only interested in the numbers greater than 5. Prior to Java 9, we would accomplish this as follows:

jshell> numbers.stream().filter(j -> j > 5).collect(Collectors.groupingBy(i -> i % 2, Collectors.counting()))
$2 ==> {1=3}

The result correctly says that we have 3 odd numbers greater than 5 in our list. However, it doesn’t tell us we don’t have any even numbers there. The even numbers were filtered out before aggregation, and any trace of them is lost. That’s exactly what the filtering collector addresses. In Java 9, we can count the numbers in both bags as follows:

jshell> numbers.stream().collect(Collectors.groupingBy(i -> i % 2, Collectors.filtering(j -> j > 5, Collectors.counting())))
$3 ==> {0=0, 1=3}

As we can see now, there are clearly no even and 3 odd numbers matching our condition.

Collectors.flatMapping

Collectors.flatMapping is to Collectors.mapping what Stream.flatmap is to Stream.map. Like the flatmap operation, the flatMapping collector allows us to handle nested collections (collections in streams) better, in this case on the collectors’ side. Like Collectors.mapping, Collectors.flatMapping takes a function to be applied to the input elements, and a collector to accumulate the elements passed through the function. Unlike Collectors.mapping, however, Collectors.flatMapping deals with a stream of elements, which allows us to get rid of often unnecessary intermediary collections.

Consider a stream of some data entries, for example integers associated with collections of strings, such as the following:

jshell> Stream<Map.Entry<Integer,Set<String>>> entries = Stream.of(Map.entry(1, Set.of("a", "b")), Map.entry(1, Set.of("a", "c")), Map.entry(2, Set.of("d")))
entries ==> java.util.stream.ReferencePipeline$Head@69379752

Now suppose we want to aggregate this data, e.g. group the strings by the integers they’re associated with. Using Java 8 constructs, we would accomplish this as follows:

jshell> entries.collect(Collectors.groupingBy(e -> e.getKey(), Collectors.mapping(e -> e.getValue(), Collectors.toSet())))
$1 ==> {1=[[b, a], [c, a]], 2=[[d]]}

Although this is technically correct, we end up with nested collections, which we would need to further unwrap to, for example, deal with duplicates. Fortunately, that’s exactly what the new flatMapping collector is for:

jshell> entries.collect(Collectors.groupingBy(e -> e.getKey(), Collectors.flatMapping(e -> e.getValue().stream(), Collectors.toSet())))
$2 ==> {1=[a, b, c], 2=[d]}

To sum up, the updates to the Stream API introduced in Java 9 are not huge. They do, however, tie up some loose ends, and ultimately allow us to write cleaner code, which is always exciting!