Archive for the 'Internet Standards' Category

“Official” statement about status of PAF premature

David April 14th, 2008

Genealogy expert, DearMYRTLE, raised a concern about my previous quote regarding PAF.  She did some research into whether my sources were considered "official" or not.  Contacting Gordon Clarke, LDS Family History Department, directly she found that Paul Nauta was the only person who can make "official" statements such as this. 

I’ll concede that this statement on the future of PAF is a bit premature for the general public.  Nevertheless, for software engineers who will be using the New FamilySearch API or the PAF API to create plugins for PAF, this statement from the Family History department makes some good suggestions for how new products can complement the technology offerings which the LDS Church has made and will be making to the community.  The Ancestry Insider has offered some additional insights into specific areas of this message.

Another huge Google milestone for independent developers

David April 8th, 2008

There are now thousands of instances where developers and businesses have used Google Maps APIs to create maps integrated with their own data (also called mashups).  Where geographic data used to be limited to specialized developers and organizations with a large budget, this ability is now in the hands of high school students with pet projects. 

Last night Google launched a preview release of Google App Engine, "a way for developers to run their web applications on Google’s infrastructure."  This promises to be equally enabling but not limited to a specific function like mapping tools.  Any generic application which requires an application server, web server, and database can be powered by this infrastructure.

With Google App Engine, developers can write web applications based on the same building blocks that Google uses, like GFS and Bigtable. Google App Engine packages those building blocks and provides access to scalable infrastructure that we hope will make it easier for developers to scale their applications automatically as they grow. This means they can spend less time dealing with system administration and maintenance, and more time building and improving their applications. (There’s more detail on the new App Engine Blog.)

…This preview of Google App Engine is available for the first 10,000 developers who sign up; we will increase that number in near future. So, developers, please sign up, download the SDK, and start your engines.

Here is a geeky step by step video for using Google App Engine with Python:

Using open standards lets the community redesign your site ;-)

David March 25th, 2008

LibraryThing designA scary thought… letting anonymous users on the internet redesign the look of your web site. But the folks over at LibraryThing are doing that very thing. Well, almost. They are letting user submit proposed designs so that they can be reviewed by the internal development staff.

So far, some two dozen members have contributed CSS stylesheets and one, zanix, produced a highly original design, executed entirely in Photoshop. MarkBarnes, acting on a suggestion from Abby, produced a very attractive design, based on the design of Cork’d, “LibraryThing for wine.” All told, there have been some really interesting ideas, and fetching new color palates. I’m still not sure where to take the design, but it’s given me a lot to think about. (It’s certainly pointed out some structural problems with our mark-up too.)

I don’t know how many active users are over there, but they have collectively submitted over 25 million books to their collective virtual book shelves. This goes to show that when you have built the right kind of community for your site, you can get all kinds of feedback and contribution back to the community.

So what does this mean for you?  If you take the time to design your web site from the ground up using standards based HTML and CSS (even if you just have a blog site), it will pay off many times over when you need to make minor changes to your design.  All of the popular PHP open source projects have taken this to heart, making it easy to create themes associated with their products.  (For example, there are literally hundreds of WordPress themes available to download for free use or purchase.)

LDS Church archives to become the Google Books of genealogical records

David March 21st, 2008

FamilySearch logoLast night I attended the Utah Java Users Group (UJUG) meeting and heard a presentation by senior developers and leadership from the LDS Church FamilySearch Digital Pipeline teams. I believe that it is certainly worth mentioning.

To give you a little background…

If you haven’t had a chance to see how the Google Books indexing project is coming along, take a look. They are taking scanners into university libraries across the US and scanning and indexing the full text. Not all of this is searchable online because of copyright issues but nevertheless huge number of books are now available because the copyright is out of date or because the publisher has granted Google rights to make them available.

The LDS Church has been scanning historical documents since the 1930s onto microfilm and microfiche and stored them at the Granite Mountain Vault for safekeeping. Now they are digitizing these scanned records as well as digitizing other records as they become available.

FamilySearch Indexing

With the newest scanning technology, they anticipate being able to completely scan all documents in the vault in 8 to 10 years. With terabytes of digitized images of censuses, birth / marriage / death certificates, and other records, the next step is to index this data. The technology for automatic indexing of handwritten documents is still not ready for production use but when you have an army of 130,000+ volunteers, you can utilize the strengths of technology to present the necessary information quickly and use the strengths of individuals to identify handwritten text. Doing so, they have been able to index up to 500,000 names per day. This includes double entry (two separate extractors) and arbitration if the data doesn’t match perfectly.

The Granite Mountain Vault isn’t the only digital data that is being processed by this program. Several US states have donated their records to the Family History Department to scan and preserve their data. The LDS Church is under negotiations with other public entities to extend the records that will be available. If you want to participate in this program, go to http://familysearchindexing.org. Don’t worry, they won’t run out of work for you to do any time soon!

FamilySearch Record Search

Although this isn’t open for public beta yet (summer 2008) all of the records that these volunteers are indexing are already available at http://search.labs.familysearch.org. They have developed Rich Internet Applications (RIA) utilizing a RESTful Web Services framework running on Java and open source technologies. They are building a highly scalable, parallel architecture to handle 100 requests per second (currently handling 80/sec). The presenter, Rob Edwards, said that in early negotiations with a 3rd party development company who supports eBay Japan (their architecture handles 3 requests per second) walked away from negotiations because they didn’t believe that their technology would handle so many more transactions.

They didn’t say this, but in a competitive work environment I’m sure that many companies are trying to recruit developers from these teams because they have been able to solve problems top companies are currently facing. Did I say that they are hiring?

Building Religious Mashups

David March 19th, 2008

Much the same way that various site have used GoogleMaps to create their own web content, the LDS Church has released an API to the scriptures so that this data can be redistributed and utilized for other applications. I’m excited to see more and more of these kinds of services being offered by the Church.

There is a lot of interest in the community to offer scriptures services. Here are a few that I have found:

Prototype of mobile scriptures: http://cameronmoll.com/sandbox/mobile/v2.php
Another version of scriptures, with social networking features: http://www.scripturesapp.com
BYU project to cross reference General Conference talks with the scriptures: http://scripturalindex.byu.edu/
Index for Jesus the Christ (Talmage) with the scriptures: http://www.connorboyack.com/blog/jesus-the-christ-index
Wiki-like project for the Bible in multiple languages (including Hebrew): http://biblos.com/
List of open source projects utilizing the scriptures: http://ldsoss.org/index.php/Religious_Study
Mormon Documentation project: http://mdp.nephi.org/

Linking to LDS.org audio files

David March 15th, 2008

If you have interest in either downloading MP3 files of scriptures or talks, you might be interested in the standard the the Church is using for their web site (from ldsWebguy).

This post describes the file naming convention for audio, video, PDF, PDA, and other files provided on Church Web sites for download. (Note: This post replaces my previous post MP3 File Naming Strategy.)