Skip to main content

Shell Sort Notes

I don't understand this passage.  What are the two alternatives? (Algorithms, 258)
One way to implement shellsort would be, for each h, to use insertion sort indepen-
dently on each of the h subsequences. Because the subsequences are independent, we
can use an even simpler approach: when h-sorting the array, we insert each item among
the previous items in its h-subsequence by exchanging it with those that have larger
keys (moving them each one position to the right in the subsequence).
Have to research different implementations of Shell Sort.  Would also help to clarify the mistake I made in my discussion yesterday.

Having said that, I can think of two possibilities.  Given, for instance, h of 3 and a sequence:

1 2 3 4 5 6 7 8 9

-- two methods would be first:

Compare and exchange 1 and 4...
...2 and 5
...3 and 6
...4 and 7
...5 and 8
...6 and 9

-- or second:

Do an insertion sort on 1, 4, and 7
Do an insertion sort on 2, 5, and 8
Do an insertion sort on 3, 6, and 9

Is that what the authors are getting at?  What is the difference between "using insertion sort independently on each of the h subsequences" and "inserting each item among the previous items in its h subsequence"?

Maybe the first method looks like this:

For each element in the array, perform an insertion sort that increments by h (instead of 1 -- why Shell Sort is a generalization of insertion sort).  Then go to the next h in the increment sequence.

And the second method looks like this:

Iterate through the array.  (Because we'll be looking h elements to the left of each item, we can start at h.)  For each item, exchange it with the element h to its left if the left element is less until you reach the beginning of the h-sequence to which it belongs (i.e. and index less than or equal to h).

The second method I know is just a refined description of the implementation I discussed yesterday (cribbed from memory).  So I just have to make sure that the first method really is different from the second.  Well, in some sense, the two must be equivalent -- but they should be different operationally.  So how are they different?

Because the first method goes through every element in one h-sequence and puts it into place before moving to the next h-sequence.  The second, on the other hand, goes element by element and organizes each of the h-sequences as it goes.  It must be like finishing five braids either by performing all the first's weaves, then all the second's weaves, in order, or by performing the first weave for the first, the first weave for the second, and so on -- in order, but a different order.

Comments

Post a Comment

Popular posts from this blog

Getting Geodata From Google's API

The apps I'm going to be analyzing are part of Dr. Charles Severance's MOOC on Python and Databases and work together according to the following structure (which applies both in this specific case and more generally to any application that creates and interprets a database using online data). The data source, in this case, is Google's Google Maps Geocoding API.  The "package" has two components: geoload.py  and geodump.py .  geoload.py  reads a list of locations from a file -- addresses for which we would like geographical information -- requests information about them from Google, and stores the information on a database ( geodata.db ).  geodump.py  reads and parses data from the database in JSON, then loads that into a javascript file.  The javascript is then used to create a web page on which the data is visualized as a series of points on the world-map.  Dr. Severance's course focuses on Python, so I'm only going to work my way through ...

The Jump Algorithm

Meetup Went to a Coding Whiteboard Meetup tonight.  It was pretty great.  One of the leaders was even a CS master's student.  At first, honestly, I felt a little bit frustrated, especially because everyone around me seemed to be using pretty high level concepts / approaches that I wasn't familiar with.  But I found someone and relentlessly talked him through his approach until we both kind of realized there were issues in the problem we hadn't worked out yet.  I guess it just reinforces my feeling that when something seems too difficult, if you can, you need to find someone and force him to explain it to you in terms you can understand.  If the people around you really understand what they're about, they will have no problem and you'll learn a lot (assuming they're patient, I guess).  If they don't, you'll realize you aren't as alone as you thought you were.  Bit of the old Socrates. Problem So imagine you have an array with a bunch of numbe...

Shell Sort

Today I spent a little bit of time researching the "Shell" sort.  I wanted to post a few notes about the Princeton Algorithms Course's implementation to help me solidify my understanding. First, a little tidbit.  When I first heard about this algorithm, I thought it had something to do with shell games.  Turns out a man named Donald Shell discovered this method of sorting, whence the name. The Algorithms  book gives the following explanation (Sedgewick and Wayne,  Algorithms, 4th ed., p. 258): The idea is to rearrange the array to give it the property that taking every hth entry (starting anywhere) yields a sorted subsequence. Such an array is said to be h-sorted. Put another way, an h-sorted array is h independent sorted subsequences, interleaved together. By h-sorting for some large values of h, we can move items in the array long distances and thus make it easier to h-sort for smaller values of h. Using such a procedure for any sequence of values o...