Skip to main content

Shell Sort Notes

I don't understand this passage.  What are the two alternatives? (Algorithms, 258)
One way to implement shellsort would be, for each h, to use insertion sort indepen-
dently on each of the h subsequences. Because the subsequences are independent, we
can use an even simpler approach: when h-sorting the array, we insert each item among
the previous items in its h-subsequence by exchanging it with those that have larger
keys (moving them each one position to the right in the subsequence).
Have to research different implementations of Shell Sort.  Would also help to clarify the mistake I made in my discussion yesterday.

Having said that, I can think of two possibilities.  Given, for instance, h of 3 and a sequence:

1 2 3 4 5 6 7 8 9

-- two methods would be first:

Compare and exchange 1 and 4...
...2 and 5
...3 and 6
...4 and 7
...5 and 8
...6 and 9

-- or second:

Do an insertion sort on 1, 4, and 7
Do an insertion sort on 2, 5, and 8
Do an insertion sort on 3, 6, and 9

Is that what the authors are getting at?  What is the difference between "using insertion sort independently on each of the h subsequences" and "inserting each item among the previous items in its h subsequence"?

Maybe the first method looks like this:

For each element in the array, perform an insertion sort that increments by h (instead of 1 -- why Shell Sort is a generalization of insertion sort).  Then go to the next h in the increment sequence.

And the second method looks like this:

Iterate through the array.  (Because we'll be looking h elements to the left of each item, we can start at h.)  For each item, exchange it with the element h to its left if the left element is less until you reach the beginning of the h-sequence to which it belongs (i.e. and index less than or equal to h).

The second method I know is just a refined description of the implementation I discussed yesterday (cribbed from memory).  So I just have to make sure that the first method really is different from the second.  Well, in some sense, the two must be equivalent -- but they should be different operationally.  So how are they different?

Because the first method goes through every element in one h-sequence and puts it into place before moving to the next h-sequence.  The second, on the other hand, goes element by element and organizes each of the h-sequences as it goes.  It must be like finishing five braids either by performing all the first's weaves, then all the second's weaves, in order, or by performing the first weave for the first, the first weave for the second, and so on -- in order, but a different order.

Comments

Post a Comment

Popular posts from this blog

Getting Geodata From Google's API

The apps I'm going to be analyzing are part of Dr. Charles Severance's MOOC on Python and Databases and work together according to the following structure (which applies both in this specific case and more generally to any application that creates and interprets a database using online data). The data source, in this case, is Google's Google Maps Geocoding API.  The "package" has two components: geoload.py  and geodump.py .  geoload.py  reads a list of locations from a file -- addresses for which we would like geographical information -- requests information about them from Google, and stores the information on a database ( geodata.db ).  geodump.py  reads and parses data from the database in JSON, then loads that into a javascript file.  The javascript is then used to create a web page on which the data is visualized as a series of points on the world-map.  Dr. Severance's course focuses on Python, so I'm only going to work my way through ...

Shell Sort

Today I spent a little bit of time researching the "Shell" sort.  I wanted to post a few notes about the Princeton Algorithms Course's implementation to help me solidify my understanding. First, a little tidbit.  When I first heard about this algorithm, I thought it had something to do with shell games.  Turns out a man named Donald Shell discovered this method of sorting, whence the name. The Algorithms  book gives the following explanation (Sedgewick and Wayne,  Algorithms, 4th ed., p. 258): The idea is to rearrange the array to give it the property that taking every hth entry (starting anywhere) yields a sorted subsequence. Such an array is said to be h-sorted. Put another way, an h-sorted array is h independent sorted subsequences, interleaved together. By h-sorting for some large values of h, we can move items in the array long distances and thus make it easier to h-sort for smaller values of h. Using such a procedure for any sequence of values o...

It's a Date

I guess I should really be putting these things up in GitHub.  The way I see it, the coding journal is just a place to share the code I write or study along with any notes I have about it.  It's sort of a documentation LiveJournal, if you will. Anyway, this is a "study" for my project idea: create an app that will prompt the user for two dates, then calculate the difference between them. The burden of this study is twofold: (1) convert dates in standard American form (e.g. December 15, 1993) into dates in standard American numeric form (e.g. 12/15/1993); (2) create a numerical representation of the date. To process the date, I started with a list of the months.  I then used a loop to create a dictionary that would attach a value to each month. Next I had to parse the user entry (I haven't added any debugging for incorrect entries yet). I did so by splitting the entry into "raw" data.  I used my dictionary to process the month name into a number, str...