The latest Nextflow version 2020.10.0 is the first stable release running on Groovy 3.
The first benefit of this change is that now Nextflow can be compiled and run on any modern Java virtual machine, from Java 8, all the way up to the latest Java 15!
Along with this, the new Groovy runtime brings a whole lot of syntax enhancements that can be useful in the everyday life of pipeline developers. Let’s see them more in detail.
The !
(not) operator can now prefix the in
and instanceof
keywords.
This makes for more concise writing of some conditional expression, for example, the following snippet:
list = [10,20,30]
if( !(x in list) ) {
// ..
}
else if( !(x instanceof String) ) {
// ..
}
could be replaced by the following:
list = [10,20,30]
if( x !in list ) {
// ..
}
else if( x !instanceof String ) {
// ..
}
Again, this is a small syntax change which makes the code a little more readable.
The elvis assignment operator ?=
allows the assignment of a value only if it was not
previously assigned (or if it evaluates to null
). Consider the following example:
def opts = [foo: 1]
opts.foo ?= 10
opts.bar ?= 20
assert opts.foo == 1
assert opts.bar == 20
In this snippet, the assignment opts.foo ?= 10
would be ignored because the dictionary opts
already
contains a value for the foo
attribute, while it is now assigned as expected.
In other words this is a shortcut for the following idiom:
if( some_variable != null ) {
some_variable = 'Hello'
}
If you are wondering why it’s called Elvis assignment, well it’s simple, because there’s also the Elvis operator that you should know (and use!) already. 😆
Groovy 3 supports the syntax for Java lambda expression. If you don’t know what a Java lambda expression is don’t worry; it’s a concept very similar to a Groovy closure, though with slight differences both in the syntax and the semantic. In a few words, a Groovy closure can modify a variable in the outside scope, while a Java lambda cannot.
In terms of syntax, a Groovy closure is defined as:
{ it -> SOME_EXPRESSION_HERE }
While Java lambda expression looks like:
it -> { SOME_EXPRESSION_HERE }
which can be simplified to the following form when the expression is a single statement:
it -> SOME_EXPRESSION_HERE
The good news is that the two syntaxes are interoperable in many cases and we can use the lambda syntax to get rid-off of the curly bracket parentheses used by the Groovy notation to make our Nextflow script more readable.
For example, the following Nextflow idiom:
Channel
.of( 1,2,3 )
.map { it * it +1 }
.view { "the value is $it" }
Can be rewritten using the lambda syntax as:
Channel
.of( 1,2,3 )
.map( it -> it * it +1 )
.view( it -> "the value is $it" )
It is a bit more consistent. Note however that the it ->
implicit argument is now mandatory (while when using the closure syntax it could be omitted). Also, when the operator argument is not single value, the lambda requires the
round parentheses to define the argument e.g.
Channel
.of( 1,2,3 )
.map( it -> tuple(it * it, it+1) )
.view( (a,b) -> "the values are $a and $b" )
Since version 8, Java provides a stream library that is very powerful and implements some concepts and operators similar to Nextflow channels.
The main differences between the two are that Nextflow channels and the corresponding operators are non-blocking i.e. their evaluation is performed asynchronously without blocking your program execution, while Java streams are executed in a synchronous manner (at least by default).
A Java stream looks like the following:
assert (1..10).stream()
.filter(e -> e % 2 == 0)
.map(e -> e * 2)
.toList() == [4, 8, 12, 16, 20]
Note, in the above example filter, map and toList methods are Java stream operator not the Nextflow homonymous ones.
The new runtime also allows for the use of the ::
operator to reference an object method.
This can be useful to pass a method as an argument to a Nextflow operator in a similar
manner to how it was already possible using a closure. For example:
Channel
.of( 'a', 'b', 'c')
.view( String::toUpperCase )
The above prints:
A
B
C
Because to view operator applied the method toUpperCase to each element emitted by the channel.
The new Groovy runtime brings a lot of syntax sugar for Nextflow pipelines and allows the use of modern Java runtime which delivers better performance and resource usage.
The ones listed above are only a small selection which may be useful to everyday Nextflow developers. If you are curious to learn more about all the changes in the new Groovy parser you can find more details in this link.
Finally, a big thanks to the Groovy community for their significant efforts in developing and maintaining this great programming environment.