Interesting files
June 15, 2006
Ubuntu dapper release time. I spent several hours tonight downloading the
about to be official boot images and running through a full re-install. Things
went through without issue. Sweet. I look forward to really playing with
linux on this hardware.. and being able to compare it to solaris.
June 13, 2006
Over the last several days/nights, David and Fabbione have been busy little beavers.
They've been trying numberous new kernels and doing some binary searching to narrow
down the underlying cause. With the release deadline for Ubuntu Dapper looming
scarily close, it looks like a 'good enough' work around for the problems booting
the linux/non-smp kernel has been found. Work was made considerably easier when
David Miller was able to finally reproduce the problem on his own box once some of
the triggering events were narrowed down on my box. There's apparantly a tiny
block of memory just shy of the 8G mark that tickles some issue that's yet to be
well explained. However, by ignoring any region of memory smaller than 256Kb the
issue goes away. That's the fix that dapper is going to ship with until a better
fix can be worked out. For those interested,
the change.
With this fix in place, my box is, for the first time, happy with a standard
boot.img file.
June 6, 2006
Setup access for fabbione to be able to upload images to my tftp server and work
remotely while I'm asleep. The fun of living on opposite sides of the planet.
Technology is great, isn't it?
May 30, 2006
Spent a good bit of time updating gdc to the recently released 0.18 and re-testing
both some local patches and the original source. The t2000 shined here once again.
Test runs in the 2 hour range. Filed a few bugs, but generally a pretty solid
release and much needed, syncing up with modern versions of dmd. Subsequent commits
to the now public svn repository bring it up to fully current with regard to dmd.
In anticipation of having to return the box soon, sadly, I need to start feeding
patches back to dvdf. The t2000 hardware is certainly very nice to work with, but
I can't afford the over $8k it'd cost to keep it. If I could, it'd be tempting.
May 26, 2006
Whoo hoo.. the install made big progress while I snoozed. I stayed home longer than
I should have to finish the install prompts and headed to work quite late, but with
a nice working bootable t2000/linux box.
Time passes and work is over for the day (at least mostly..).
After playing around for a bit with the uni-processor kernel, seeing that the box
seemed stable, fabbione suggested booting into the stock -smp kernel that was
supplied by the install process. I had doubts, but shouldn't have. As usual, he
was right.. it booted, saw all 16 processors, and is generally mostly happy.
Now, the fun.. some box stressing to see how it holds up. I threw a 'make -j' at
it. Full parallelism, no holds barred. It oopsed. Then it hit another kernel bug
that's apparantly already known dealing with SLAB and fd releasing.
The testing continues....
May 25, 2006
Well, with my trial period nearing the end, and David Miller being swamped, it was
time to take things into my own hands. I almost did this back a month or so ago, but
at the time it seemed like David was gonna be available and his experience and skill
with sun hardware and kernel issues so vastly trumps mine, that I didn't bother.
I don't want to loose a known broken config since that'd make debugging the problem
essentially impossible. So, the choice, dig in!
I hopped back on to #ubuntu-ports and got some hand-holding from fabbione, which has
made things _so_ much easier. He was doing builds with varous kernel debug options
and I was booting, and rebooting, and rebooting.. all trying to descern a pattern.
Some boots would work, some wouldn't. We finally got through into user space and hit
some known kernel module abi issues. So, fabbione built one more boot.img that had
a bunch more builtin modules, enough to get through the install. The download time
is huge.. so... sleep.
April 26, 2006
Been too long since I updated this log of my status. I'll fill in some past data this weekend
but for now, a quick update. Feelings are mixed. As advertised, the box is pretty decent
when all cores and threads are kept fed with work. My test workload is two fold:
Both tend to keep most of the cpu's busy much of the time, but both are also somewhat disk
intensive. I'm fairly sure that I'm hitting disk io limits which is keeping me from keeping
all the cpu's busy. For compilations, my old Athlon 1700 outpaces the T2000. Numbers will
be posted this weekend after I can run some more scientific tests (ok, so I can actually
write down the numbers). One set of numbers I can share right now are dstress run times.
On the Athlon 1700, with no attempt at parallelization, a full run would take 8-10 hours.
On the T2000 with 'make -j32' (with 16 cores), a full run takes approximately 3-3.5 hours.
Now, an important caveat, a whole lot more tests fail on solaris/sparc than on linux/x86
so the numbers aren't 100% comparable. But regardless, over twice as fast is very nice.
More soon.
April 9, 2006
One of the reasons I felt confident with this try/buy program is that David S. Miller had
already gotten the linux kernel booting and a full, normal, Ubuntu Linux installed and running
on his box. I assumed that I'd be able to do the same and work with an os I was comfortable
with. So, I set about reading up on network booting and locating the boot image. I finally
got all the pieces together after much reading and some rapid learning about rarpd and tftp.
Shortly after the elation of that success, I hit a wall, the kernel wouldn't make it through
boot (
log).
I proceeded to subscribe to the sparclinux mailing list. After checking the archives I didn't
see any traffic about T2000 boot issues, so time for a
new thread.
Luckily both David (long time sparc kernel expert) and Fabio (Ubuntu kernel guy and also a
sparc guy) were both up and quickly engaged on the problem. After hopping on irc to short-circuit
the communication lag of email, we tred a few things: an older image, some command line args,
comparing hardware (prtconf output) between the three of us. Nothing
obviously different except for cpu counts.
The working theory before we break for the night is that it's either buggy hardware, though
solaris doesn't seem to have any problems, or a bug that's triggered with 16 cpus but not 24
or 32 which is what they have. Unfortunatly David is going to be busy for a couple days
but will get back to this probably wednesday. During some of the testing, I did get past the
above error and got a different one that doesn't halt the boot process.
The Ubuntu install process starts, but stalls after the second screen, almost certainly due
to the underlying problem causing both the boot failure and the oops. More later this week I
expect.
If you're paying way too much attention, then you might have noticed the mention of 16 cpu's
in the above paragraph and that I'd asked for the 8 core model. Yup, I was shipped the wrong
configuration, 4 cores instead of 8. If the problem with booting the linux kernel is related
to # of cpu's, it's somewhat serendipitous, since catching the bug is good. However, it's
half the cpu's I want for my testing. Mail has been sent to the guys at Sun. I'd like to
get the right config, but until this linux boot issue is resolved, I don't want to give up
the currently breaking configuration either.
April 8, 2006
Sorrow and frustration ensues. There's a required cable that wasn't included in the shipment.
It's the SC serial management cable. It's how you actually get into the box to perform the
initial setup. You only need it once after which you can use ethernet to get to both the
management port and the standard login port. I don't have one laying around. What I do have,
though, is spare network cables and serial cables and can cobble one together. That done I
can successfully, for the first time, connect and figure out how to boot the machine.
Word to the wise, including some clueless newbie instructions on the website or with the
hardware would be invaluable. A side by side comparison with linux would do wonders. I
haven't used solaris as a user in a decade and never really as an administrator. After getting
the initial boot and setup of solaris done, I feel rather lost. Few of the normal admin
tools I've come to expect under linux are present. I know I can download a lot from the net
and will, but it's like a deep form of culture shock.
April 7, 2006
Delivery! Alas, work comes first, so play time will have to wait.
April 5, 2006
Lacking a contact with the shipper, I make calls to sun to get the status of my shipment and
try to track down the shipper. After a few attempts, apparantly the DHL they use and the DHL
the rest of the world thinks of as DHL are two different entities. I finally talk with the
dispatcher for the right DHL and arrange for a friday delivery.
April 4, 2006
Waiting at home all day didn't accomplish anything. No calls from the shipper. No visits from
the shipper. I'm lost and anxious to get down to some serious progress.
April 3, 2006
The shippers attempted to deliver a pallet to me but not being home, they didn't succeed. The
phone number they left in voice mail is incorrect, grf. I'll work from home tomorrow to
receive the shipment.
March 24, 2006
Sweet, I got an email today about the box being shipped. The 10 day countdown has started.
March 17, 2006
My credit limit has been raised, yay. A quick mail sent off to the gentleman I've been talking
to have him call me back.
March 15, 2006
I talk with another gentleman on the phone for a bit. Apparantly there are no options for
individuals other than authing a credit card. For companies, however, the doors are much wider.
As long as their finance team determines that the company is worth or worth the risk than
chances are very good the application would sail through. Choices: up my credit limit, form
a company, find one willing to back me. I opt for #1, as the most appropriate and on the
up and up.
March 14, 2006
A major wrinkle forms in this little saga. Apparantly before Sun can/will ship me the trial
hardware, I have to be able to provide them with a credit card they can auth for the cost of
the box. This is a problem as I've never needed a credit limit this high. I ask them to call
me to see what the options are.
March 9, 2006
Worried that I haven't heard from anyone in several days, I send off a ping to the gentleman
I've been exchanging mails with. He assures me that it'll ship soon and I should have it
about 10 days later.
Success, a flurry of emails back and forth with a couple hardware config quotes and I
picked the medium model: Medium: 8 cores, 16 x 512MB (T20-108A-08GA2C) -- US$13,395.
Pricy compared to the x86 hardware I'm used to dealing with, but it's a trial, so roll
with it.
March 6, 2006
Attempt #2: After several days with no response, I decide it's time to try again. Still no
company, but I said I had a product, as I was highly confident that gdc could be ported to
run on sparc/linux and sparc/solaris without too much trouble, certainly within the 60 day
period leaving time to take real advantage of the hardware.
February 23, 2006
Well, I got wind (thanks
slashdot)
of a wonderful
promotion
that sun is running for their new
T2000
hardware line. It specializes and seems to be specifically targeted at the large niche
of parallel tasks such as web servers or software compilation. I happened to have
recently become involved with The D Programming Language, specifically the compiler itself.
Building the compiler doesn't take super long, but running the test suite does (approximately
12 hours on my fastest box, an athlon 1700 w/ 3 gigs ram). Both are tasks, being perfectly
suited for this hardware so I just had to sign up for the try and buy program.