Comp201: Principles of Object-Oriented Programming I
Spring 2008 -- Arrays and Array Processing   


There are many ways to store data. So far, we have covered linear recursive structures, lists, and binary recursive structures, trees. Let's consider another way of storing data, as a contiguous, numbered (indexed) set of data storage elements:

anArray =

itemA itemB itemC itemD itemE itemF itemG itemH itemI itemJ
0 1 2 3 4 5 6 7 8 9

This "array" of elements allows us to access any individual element using a numbered index value. That is, anArray[4] should give us itemE. Likewise, the statement anArray[7] = 42 should replace itemH with 42.

At its most basic form, an array is a random access data structure where any element in it can be accessed by specifiying a single index value corresponding to that element.

Notice however, that the above definition is not a recursive definition. This will cause problems.

Arrays in Java

More information on arrays can be found in the Java Resources web site page on arrays

Always remember: Arrays are size and speed at a price.

 

Array Types

Array Variables

Array Creation

Array Accesses

 

 

Array Processing Using Loops

More information on loops can be found at the Java Resources web site page on loops.

The main technique used to process arrays is the for loop. A for loop is a way of processing each element of the array in a sequential manner.

Here is a typical for loop:

// Sum the number of elements in an array of ints, myArray.

int sum = 0;  // initialize the sum

for(int idx=0; idx < myArray.length; idx++) {  //start idx @ 0; end idx at length-1; increment idx every time the loop is processed. 

	sum += myArray[idx];  // add the idx'th element of myArray to the sum

}  

There are a number of things going on in the above for loop:

One can traverse an array in any direction one wishes:

// Sum the number of elements in an array of ints, myArray.

int sum = 0;  // initialize the sum

for(int idx=myArray.length-1; 0<=idx; idx--) {  //start idx @ length-1; end idx at 0; decrement idx every time the loop is processed. 

	sum += myArray[idx];  // add the idx'th element of myArray to the sum

}  

The above loop sums the list just as well as the first example, but it does it from back to front. Note however, that we had to be a little careful on how we initialized the index and how we set up the termination condition.

Here's a little more complicated loop:

// Find the index of the smallest element in an array of ints, myArray.

int minIdx = 0;  // initialize the index.  Must be declared outside the loop.

if(0==myArray.length) throw new NoSuchElementException("Empty array!");  // no good if array is empty!

else {

	for(minIdx = 0, int j = 1; j<myArray.length; j++) {  //start minIdx @ 0, start index @ 1 ; end index at length-1; increment index every time the loop is processed. 

		if(myArray[minIdx] > myArray[j]) minIdx = j;  // found new minimum

	}

} 	 

Some important things to notice about this algorithm:

For convenience, Java 5.0 now offers a compact syntax used for traversing all the elements of an array or of anything that subclasses type Iterable:

MyType[] myArray;  // array is initialized with data somewhere
for(MyType x: myArray){
    // code involving x, i.e. each element in the array
}

It is important to remember that this syntax is used when one wants to process every element in an array (or an Iterable object) independent of order of processing because Java does not guarantee a traversal order. 

Let's look at an algorithm where we might not want to process the entire array:

// Find the first index of a given value in an array

int idx = -1;  // initialize the index to an invalid value.

for(int j=0; j<myArray.length; j++) {  //no initialization ; end index at length-1; increment index every time the loop is processed. 

	if(desiredValue == myArray[j]) { // found match!

		idx = j;  // save the index.

		break;  // break out of the loop.

	}

}

Notes:

There is a counterpart to break called continue that causes the loop to immediately progress to the beginning of the next iteration. It is used much more rarely than break, usually in situations where there are convoluted and deeply nested if-else statements.

Can you see that the price of the compact syntax of for loops is a clear understandability of the process?

While loops

for loops are actually a specialized version of while loops. while loops have no special provisions for initialization or loop incrementing, just the termination condition.

while loops iterate through the loop body until the termination condition evaluates to a false value.

The following for loop:


for([initialization statement]; [termination expr] ; [increment statement]) {

	[loop body]

}

Is exactly equivalent to the following:


{

	[initialization statement];

	while([termination expr]) {

		[loop body]

		[increment statement];

	}

}

Note the outermost curly braces that create the scoping boundary that encapsulates any variable declared inside the for loop.

The Java compiler will automatically convert a for loop to the above while loop.

Here is the above algorithm that finds a desired value in an array, translated from a for loop to a while loop:


// F// Find the index of the first occurance of desiredValue in myArray, using a while loop.

{

	idx = -1;  // initialize the final result

	int j = 0; // initialize the index

	while(j< myArray.length) {   // loop through the array

		if(desiredValue == myArray[j]) {   // check if found the value

			idx = j;  // save the index

			break;   // exit the loop.

		}

		j++;   // increment the index

	}

}

Basically, for loops give up some of the flexibility of a while loop in favor of a more compact syntax.

while loops are very useful when the data is not sequentially accessible via some sort of index. Another useful scenario for while loops is when the algorithm is waiting for something to happen or for a value to come inot the system from an outside (relatively) source.

do-while loops are the same as while loops except that the conditional statement is evaluated at the end of the loop body, not its beginning as in a for or while loop.

See the Java Resources web site page on loops for more information on processing lists using while loops.

for-each loops

An exceedingly common for-loop to write is the following;

Stuff[] s_array = new Stuff[n];
// fill s_array with values

for(int i = 0; i < s_array.length; i++) {
	// do something with s_array[i]
}

Essentially, the loop does some invariant processing on every element of the array.

To make life easier, Java implements the for-each loop, which is just an alternate for loop syntax:

Stuff[] s_array = new Stuff[n];
// fill s_array with values

for(Stuff s:s_array) {
	// do something with s
}

Simpler, eh?

It turns out that the for-each loop is not simply relegated to array.  Any class that implements the Iterable interface will work.   We will discuss this more at a later date, as it involves the use of generics.

Arrays vs. Lists

In no particular order...
Arrays
Lists
  • Fast access to all elements.
  • Fixed number of elements held.
  • Difficult to insert elements.
  • Can run into problems with unitialized elements.
  • Minimal safety for out-of-bounds indices
  • Minimal memory used.
  • Simple syntax.
  • Must use procedural techniques for processing
  • Often incompatible with OO architectures.
  • Difficult to prove that processing algorithms are correct.
  • Processing algorithms can be very fast.
  • Processing algorithms can be minimally memory intensive.
  • Slow access except to first element, which is fast.
  • Unlimited number of elements held.
  • Easy to insert elements.
  • Encountering unitialized elements very rare to impossible.
  • Impossible to make out-of-bounds errors
  • Not optimized for memory usage.
  • More cumbersome syntax.
  • Can use OO and polymorphic recursive techniques for processing.
  • Very compatible with OO architectures.
  • Easy to prove that processing algorithms are correct.
  • Processing algorithms can be quite fast if tail-recursive and using a tail-call optimizing compiler.
  • Processing algorithms can be very memory intensive unless tail-recursive and using a tail-call optimizing compiler.

Bottom line: Arrays are optimized for size and random access speed at the expense of OO design and recursion. If you do not need speed or low memory, do not use an array. If you must use an array, tightly encapsulate it so that it does not affect the rest of your system.

 

 


Last Revised Thursday, 03-Jun-2010 09:50:30 CDT

©2008 Stephen Wong and Dung Nguyen