PARLANSE. See
http://www.semdesigns.com/Products/Parlanse/index.html
This language runs on SMP x86 systems with 1-32 CPUs.
It offers "fine grain" paralellism based primarily
on static partial orders and a compiler that synthesizes
as much of the scheduling code for a grain switch as
it can, to keep overhead low.