Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register

  Monday, April 11, 2005 

Bob Glushko and Roy Fielding on enterprise information integration

I'm in a session on enterprise information integration. First up is Bob Glushko, who's talking about document engineering. He's writing a book, it's late, and here's why. Even though he used a comprehensive Microsoft Word stylesheet to ensure maximal control over the content, a lot of stuff didn't survive the transition to Quark. Crazy, huh? Here's a guy who's capable of writing directly in XML and transforming to HTML -- that's how he built the slideshow he's showing us -- and his book is stuck in a publishing bottleneck. In the short run this may be a Bob Glushko problem, but really, it's an MIT Press and Quark problem.

The gist of the talk is that data and documents live along a continuum, but we approach them from two different perspectives. We tend to deal with data in a top-down process-oriented way, and with documents in a bottom-up user-oriented way, and these traditions have got to meet in the middle. I violently agree.

Next up is Roy Fielding, chief scientist with Day Software where, he says, he's working to recapitulate principles of web architecture in the realm of enterprise applications. The problem: siloed applications that manage their own content repositories and present their own APIs. The proposed solution: JSR 170, which specifies a standard API that Java apps can use to access diverse APIs and repositories. The incubator for the reference implementation is called Apache Jackrabbit.

There's skepticism in the room. Universal repository initiatives have come and gone, people are saying, and the benefits are clear, but will infrastructure vendors -- Oracle being the most obvious litmus test -- really get behind this one? Fielding's bet is that they will, in part because of the open source nature of the initiative.

Not every content management system runs on the Java Virtual Machine, of course. What about non-Java systems? Another way into the API is WebDAV which, Fielding says, maps nicely to JSR 170.

 

Meme tracking with Greasemonkey

A couple of months ago, I charted the flow of the ACLU Pizza movie through the blogosphere, using data from Bloglines and del.icio.us. When I visit that blog entry today, I see this notation in the upper left corner of the page:

bloglines: 11 delicious: 2
In other words, the page has been cited 11 times on Bloglines and twice on del.icio.us; the links go to the details.

This tracking data is inserted by a Greasemonkey script which is a straightforward extension of two bookmarklets (del.icio.us, bloglines) that I've been using for quite a while now.

I toyed with the notion of presenting the data as sparklines (bloglines: 29 delicious: 78) -- Edward Tufte's "intense, simple, word-sized graphics" -- but decided that would require an unreasonable amount of page-fetching and HTML-reverse-engineering.

So for now it's just raw counts, and they're fascinating. Although the script I've posted runs only on InfoWorld pages, I have to admit that the version I'm using runs on every page I read, creating a realtime display that I find much more useful than what the generic toolbars (Alexa, Google) offer.

Although it's pretty straightforward to write these Greasemonkey scripts, there are two aspects of the job that feel antiquated. One is groveling around inside Web pages -- in this case, the Bloglines and del.icio.us citation pages -- using regular expressions. The other is groveling around inside the DOM (document object model) of the page into which you're inserting instrumentation.

I think there's some technology (bloglines: 368 delicious: 23) floating around that could help here.

Update: Erik Kastner notes that the Greasemonkey restriction on receiving XML is a security precaution. Simon Willison's excellent questions about etiquette and privacy have convinced me to throttle the script back to just InfoWorld pages for now.

Greasemonkey raises tough issues. It's clearly something of enormous value that we should, as Simon suggests, handle with caution.

 


Recent Entries


















































Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist