Ruby (programming language)

From Citizendium
Revision as of 13:15, 11 October 2007 by imported>Pat Palmer (→‎Strings and regular expressions)
Jump to navigation Jump to search
Main Article
Talk Template:Default button 3
 
Template:Cell style

Ruby is a general-purpose computer programming language made available as an open-source project in 1995 by its creator, Yukihiro Matsumoto (commonly called Matz), a Japanese computer professional with experience in open-source software and familiarity with a wide variety of programming languages. Matz has closely managed Ruby releases in the years since it first appeared, and by 2007, Ruby has been ranked tenth in usage among all programming languages. Furthermore, its use is still growing[1].

In 2004, Ruby's place in the U.S. computer industry was boosted by the independent release of Rails, a Ruby-based, open-source web application framework created in the United States by David Heinemeier Hansson, a Danish developer. This article is an overview of several often-cited Ruby characteristics, independent of Rails, for which the language is sometimes both praised and criticized.

Ruby Implementations

Like many "newer" programming languages (meaning those created or updated since 1990), Ruby is fully object-oriented and requires the installation of a runtime environment, or virtual machine before Ruby programs can be developed or run.

As of October 2007, ever-improving versions of the official open-source Ruby implementation have been released on several different operating systems. These runtimes are interpreted, rather than compiled, and thus Ruby is not yet considered to be a high-performance platform suitable for some heavy-load, enterprise applications. Ruby implementations have also been developed by other groups, including JRuby (an attempt to port Ruby to the Java platform), and Rubinius (an interpreter modeled after self-hosting Smalltalk virtual machines).

As of 2007, no formal written specification has been provided for validating Ruby implementations. So although Ruby can potentially can be used to create platform-independent programs, Ruby is not currently guaranteed to be identical across platforms, and newer versions are not always upwardly compatible with older versions. Furthermore, a burgeoning number of books, articles and other documentation are not always in complete agreement about the syntax, semantics, and conventions of the language. There is widespread agreement that Ruby would benefit from having a formal specification.

Things people like (and hate) about Ruby

Despite performance and cross-version and compatibility concerns, enthusiasts of Ruby wax eloquent in praising the language, including numerous subjective statements such as "it's fun". Something of Ruby's appeal may be seen in the brevity of this Hello World program:

puts "Hello, world"

But simple as it initially may seem, Ruby is described as having hidden depths, largely as a result of its support for a complex and powerful feature called closures. Peter Cooper, author of a 2007 book about Ruby, introduces the language by stating, "Ruby has more in common with more esoteric languages such as Lisp and Smalltalk than with better known languages such as PHP and C++"[2]. Cooper's book, and numerous other sources, list several characteristics of Ruby that may allow programs to be written with more ease, speed and "joy", than with other languages, including:

  1. closures
  2. a relatively permissive syntax, said to be more like the way people think and talk
  3. loose typing
  4. good string handling and regular expressions
  5. extensive libraries for networking and web services
  6. powerful support for making calls out to the native operating system if needful

Closures

Closures are a powerful and complex feature, implemented in their most flexible form in only a few programming languages such as Smalltalk. A full discussion of closures deserves an article of its own but cannot be avoided in any serious discussion of Ruby, which encourages their widespread use. The definition and importance of closures has been widely debated with all the politeness for which the computer industry is known. The learning curve for closures appears to be steep enough to cause some dissonance with the common claim that "Ruby makes programming easier".

A closure occurs when a procedure (or a so-called block--an unnamed procedure) is physically situated inside another procedure, and the inner procedure can be referenced (called) from outside of the enclosing procedure. In some languages, the inner procedure can only read the variables in the enclosing scope. Closures become really powerful when the inner procedure can be referenced (called) from anywhere else in the code, not just from within the enclosing procedure. Languages such as C, C++, Java or C#, which implement local variables using stack frames, mostly do not allow procedures to exist inside other procedures. So-called inner classes in Java and C# are closure-like but are mainly restricted to use as event handlers for the enclosing class. Ruby allows and encourages unrestricted use of closures.

Knowing when and how to make use of a true closure--what it can be good for--is neither obvious nor simple to many developers. Ruby advocates often struggle to illustrate the potential power of closures, which formerly were not required learning for a majority of programmers (to whom they were usually not available anyway). The often-cited example of closure use in Ruby is the .each procedure, which provides a more convenient way of iterating through collections than conventional looping. But sceptics counter that even Java now provides its own very simple syntax for iterating through collections (i.e., the "enhanced for loop").

A warning from the seminal 1995 Gang of Four Design Patterns book may be relevant to this discussion: ""Dynamic, highly parameterized software is harder to understand than more static software."[3]

Even without the full power of Ruby-style closures, allowing scopes within scopes, which is at least part of what a closure does, may be seen as a hazard for unaware programmers. In a deceptively simple-seeming language such as Javascript, for example, it is not uncommon for programmers to use a variable defined in an outer scope without realizing the consequences of having done so (i.e., the variable will not be started over from scratch each time the inner procedure is called, but will act like a global instead). This is a common source of bugs in so-called Ajax applications if there is a need to have multiple Ajax (Javascript) calls going on between a web browser and web server simultaneously.

The fact that unaware programmers may hang themselves due to the complexity of a feature is not necessarily a reason to withhold that power from a programmer. Advocates claim the added power and elegance of design is worth added difficulty that results when trying to read somebody else's code that uses closures.

Permissive syntax

Ruby tries to be tolerant of syntax styles from several different programming languages, and this makes it easier for programmers to migrate from those languages into Ruby. The downside of this is that a programmer who is used to only one form of the syntax will now need to become familiar with both (or multiple) syntactic styles.

Loose typing

Like Visual Basic's default behavior, Ruby tries to infer the type of variables from the context, freeing the programmer (in many cases) from needing to declare the type explicitly upon first use. This is the opposite of what Java and C# do (they use so-called strong typing, which requires every variable's type to be known before the variable can be used).

Sometimes Ruby is described as being a "dynamic" language, which is often referring to the fact that programmers are not required to explicitly declare variables, and the Ruby runtime generally makes a pretty good guess as to how a variable should be "typed" at runtime.

Sometimes Ruby is described as being a "scripting" language, which is often referring to the need for a runtime (interpreter), as well as, possibly, the lack of strong typing.

Strings and regular expressions

Ruby has a lot of useful, built-in libraries for strings and symbols (which are like "interned" strings in Java or C#). Ruby also has special syntax allowing the use of "hashes", which are similar to keyword-based collections in Java or C#.

Networking, including web services

Calling into the OS

This feature may be regarded as a double-edged sword. On the one hand, it's very helpful to be able to take advantage of operating system facilities to expand the power of the Ruby language. On the other hand, doing so makes code less likely to be portable to another operating system.

On a historic note, Pascal (a once very promising programming language) may have failed to achieve immortality large due the inability of programmers to make calls to the underlying operating system without great difficulty. The designers of Ruby (and other programming languages created after Pascal) have taken this "object lesson" to heart and provided facilities to empower calls into the OS if needed. It can reasonably be argued that Ruby makes this easier than most other languages at present.

References

  1. "TIOBE Programming Community Index". TIOBE Software (2007). Retrieved on 2007-10-10.
  2. "Beginning Ruby: From Novice to Professional". Apress paperback book, Introduction p. xxix (2007). Retrieved on 2007-10-10.
  3. Design Patterns: Elements of Reusable Object-Oriented Software (page 21). Addison-Wesley (2007). Retrieved on 2007-05-24.