In the last post in this series, we took a look at the new convenience factory methods for collections in Java 9. Collections play well with the Stream API, which learned a few new tricks since its introduction in Java 8 as well. There are 4 particularly exciting new features – let’s explore them with JShell!
1. dropWhile
and takeWhile
Let’s take a step back and look at what was available prior to Java 9. We had an API for making streams containing elements. For example, let’s create a stream of numbers 0 to 9:
jshell> IntStream.range(0,10).forEach(System.out::println) | |
0 | |
1 | |
2 | |
3 | |
4 | |
5 | |
6 | |
7 | |
8 | |
9 |
Since Java 8, we’ve known limit(long n)
and its complementary method skip(long n)
, which allow us to take and ignore the first n
elements of a stream, respectively. For instance:
jshell> IntStream.range(0,10).limit(5).forEach(System.out::println) | |
0 | |
1 | |
2 | |
3 | |
4 | |
jshell> IntStream.range(0,10).skip(5).forEach(System.out::println) | |
5 | |
6 | |
7 | |
8 | |
9 |
These methods are very useful. However, they require us to know the number of elements to limit or skip in advance. Often, we know we want to keep taking or dropping elements while a certain condition is met, but we don’t necessarily know how many of these element we get. That’s exactly the functionality added in Java 9, courtesy of takeWhile
and dropWhile
methods:
jshell> IntStream.range(0,10).takeWhile(x -> x < 5).forEach(System.out::println) | |
0 | |
1 | |
2 | |
3 | |
4 | |
jshell> IntStream.range(0,10).dropWhile(x -> x < 5).forEach(System.out::println) | |
5 | |
6 | |
7 | |
8 | |
9 |
takeWhile
and dropWhile
are complementary to each other. They’re basically versions of limit
and skip
which take a predicate instead of a number.
2. iterate
Another useful addition to the Stream API is the method iterate
. The method is overloaded:
jshell> IntStream.iterate( | |
Signatures: | |
IntStream IntStream.iterate(int seed, IntUnaryOperator f) | |
IntStream IntStream.iterate(int seed, IntPredicate hasNext, IntUnaryOperator next) | |
<press tab again to see documentation> |
The version on line 3 was available in JDK 8, whereas the version on line 4 was added in JDK 9. To illustrate the difference, let’s try and solve a simple problem – suppose we want to print all even numbers less than 10.
A naive approach to solving this problem using Java 8 constructs could involve iteration with a filtering step, such as:
jshell> IntStream.iterate(0, i -> i + 2).filter(j -> j < 10).forEach(System.out::println) | |
0 | |
2 | |
4 | |
6 | |
8 | |
-2147483648 | |
-2147483646 | |
-2147483644 | |
-2147483642 | |
-2147483640 | |
-2147483638 | |
-2147483636 | |
-2147483634 | |
-2147483632 | |
... |
As we can see, if we try to run this simple and seemingly straightforward code, we discover it doesn’t work. At first, the correct solution is printed, then the output pauses, and continues to print incorrect numbers after a while.
It’s easy to see what’s happening here. We construct the right filter, but apply it to an infinite stream. After printing the elements matching the filtering condition, the stream silently keeps going. Our integers overflow and start matching the filtering condition again.
Unfortunately, there isn’t a concise and clean way to deal with this situation in JDK 8, which is what the recent release of the JDK fixes. We can now correctly solve our problem as follows:
jshell> IntStream.iterate(0, i -> i < 10, i -> i + 2).forEach(System.out::println) | |
0 | |
2 | |
4 | |
6 | |
8 |
If we look at the solution, we can see this version of iterate
reads like a for
loop, and that’s essentially what it is – a streamified version of for
.
3. ofNullable
Another addition to the Stream API worth talking about is the method ofNullable
. Prior to Java 9, we could create streams of arbitrary elements using the factory methods. For example:
jshell> Stream.of(1) | |
$1 ==> java.util.stream.ReferencePipeline$Head@4566e5bd |
The result is a stream containing a single element – 1
. We couldn’t do the same thing with null
though:
jshell> Stream.of(null) | |
| Warning: | |
| non-varargs call of varargs method with inexact argument type for last parameter; | |
| cast to java.lang.Object for a varargs call | |
| cast to java.lang.Object[] for a non-varargs call and to suppress this warning | |
| Stream.of(null) | |
| ^--^ | |
| java.lang.NullPointerException thrown: | |
| at Arrays.stream (Arrays.java:5610) | |
| at Stream.of (Stream.java:1187) | |
| at (#2:1) |
This makes sense – we don’t want to have null
s in our streams. Java 9, however, allows us to do the following:
jshell> Stream.ofNullable(null) | |
$3 ==> java.util.stream.ReferencePipeline$Head@72b6cbcc |
The result is not a stream containing a null
element, but an empty stream:
jshell> Stream.ofNullable(null).count() | |
$4 ==> 0 |
If you’ve worked with streams a lot, you probably immediately realize why this is an exciting feature. It makes integration better, and allows us to construct cleaner streams. Often, when using long streams with many chained operations, particularly map
s, we end up with null
s at some point in our stream. Since we don’t want to have null
s in streams, or NullPointerException
s, we need to filter the null
elements out. In practice, this involves a step with a ternary operator or an if
statement, none of which are nice. ofNullable
enables us to avoid these null
checks, which is great!
4. Collectors.filtering
and Collectors.flatMapping
Collectors are an essential part of the Stream API. They provide reduction operations which process stream elements into result containers, optionally performing a transformation on the aggregated results, after all input elements have been processed.
Java 9 added 2 new collectors, filtering
and flatMapping
, which play nice with the groupingBy
collector.
Collectors.filtering
Similarly to the filter
method on streams, the filtering
collector is used for filtering elements in a stream. The filter
method, however, processes the values before they’re grouped, whereas Collectors.filtering
can be used nicely with Collectors.groupingBy
to group the values before the filtering step takes place.
Let’s demonstrate the difference on an example. Suppose we’re given a list of numbers, e.g.:
jshell> List<Integer> numbers = List.of(2, 3, 4, 7, 9, 11) | |
numbers ==> [2, 3, 4, 7, 9, 11] |
Now suppose we need to count the numbers by parity, but we’re only interested in the numbers greater than 5. Prior to Java 9, we would accomplish this as follows:
jshell> numbers.stream().filter(j -> j > 5).collect(Collectors.groupingBy(i -> i % 2, Collectors.counting())) | |
$2 ==> {1=3} |
The result correctly says that we have 3 odd numbers greater than 5 in our list. However, it doesn’t tell us we don’t have any even numbers there. The even numbers were filtered out before aggregation, and any trace of them is lost. That’s exactly what the filtering
collector addresses. In Java 9, we can count the numbers in both bags as follows:
jshell> numbers.stream().collect(Collectors.groupingBy(i -> i % 2, Collectors.filtering(j -> j > 5, Collectors.counting()))) | |
$3 ==> {0=0, 1=3} |
As we can see now, there are clearly no even and 3 odd numbers matching our condition.
Collectors.flatMapping
Collectors.flatMapping
is to Collectors.mapping
what Stream.flatmap
is to Stream.map
. Like the flatmap
operation, the flatMapping
collector allows us to handle nested collections (collections in streams) better, in this case on the collectors’ side. Like Collectors.mapping
, Collectors.flatMapping
takes a function to be applied to the input elements, and a collector to accumulate the elements passed through the function. Unlike Collectors.mapping
, however, Collectors.flatMapping
deals with a stream of elements, which allows us to get rid of often unnecessary intermediary collections.
Consider a stream of some data entries, for example integers associated with collections of strings, such as the following:
jshell> Stream<Map.Entry<Integer,Set<String>>> entries = Stream.of(Map.entry(1, Set.of("a", "b")), Map.entry(1, Set.of("a", "c")), Map.entry(2, Set.of("d"))) | |
entries ==> java.util.stream.ReferencePipeline$Head@69379752 |
Now suppose we want to aggregate this data, e.g. group the strings by the integers they’re associated with. Using Java 8 constructs, we would accomplish this as follows:
jshell> entries.collect(Collectors.groupingBy(e -> e.getKey(), Collectors.mapping(e -> e.getValue(), Collectors.toSet()))) | |
$1 ==> {1=[[b, a], [c, a]], 2=[[d]]} |
Although this is technically correct, we end up with nested collections, which we would need to further unwrap to, for example, deal with duplicates. Fortunately, that’s exactly what the new flatMapping
collector is for:
jshell> entries.collect(Collectors.groupingBy(e -> e.getKey(), Collectors.flatMapping(e -> e.getValue().stream(), Collectors.toSet()))) | |
$2 ==> {1=[a, b, c], 2=[d]} |
To sum up, the updates to the Stream API introduced in Java 9 are not huge. They do, however, tie up some loose ends, and ultimately allow us to write cleaner code, which is always exciting!
Comments