Thursday, February 4, 2010

How to Prepare Content - Part 2

This is Part 2 in a series. You may want to first read: Part 1.

The DingLabs Reader is intended to be open to anyone to post and share their own content.

The process isn't fully documented yet, but it is usable right now, and the steps are pretty simple:
  1. The first step is to use Transcriber to create a .trs file for each chapter (or audio file) you would like to share.
  2. Upload the .trs and .mp3 files somewhere on the Internet. If you don't have a website, you might just try Google Sites to host your files.
  3. Finally, create and upload an XML file which will describe your book and the locations for each .trs and .mp3 file.  As an example, here is a link to the actual XML file that is used for Alice in Wonderland:
        http://content.dinglabs.com/book/alice
    (you might have to "View Source" or "Save Link as..." to see the XML).
Now you can run the DingLabs Reader, and tell it to load your book. You do this by including a link to your XML file in the Reader URL. Here's an example, using Alice in Wonderland again. Just use your URL in the highlighted section (although it may appear as two lines, it is one long URL):
   http://reader.dinglabs.com/Book.html#b:=http://content.dinglabs.com/book/alice

That's it! When the program loads, it will fetch your XML file, and find the .trs and .mp3 file for each chapter, so it can create the interactive display for playback.

Feel free to reply with your questions or feedback.

Tuesday, February 2, 2010

How to Prepare Content - Part 1

To prepare content to be played back with the DingLabs Reader, you can use the free, open-source tool, Transcriber.




You can use Transcriber to manually synchronize an audio file and it's transcript. The result will be a .trs file, containing the transcript text and timestamps. The DingLabs Reader simply parses this .trs file to directly produce the interactive display of the transcript.

How to Align A Book Chapter

You will need:
  • Audio file for the chapter (WAV highly recommended)
  • Full text of the chapter
  • Transcriber (downloaded and installed)
I highly recommend only using WAV files with Transcriber, to guarantee accurate alignments. Transcriber doesn't seem to generate accurate timestamps with longer MP3 files.  So, first convert your MP3 files to WAV.  You can Google for such a tool, or maybe try an online converter.

When running Transcriber for the first time, make UTF-8 the default encoding:
  1. Click 'Options' > 'General...'
  2. Select 'Unicode UTF-8' for the Encoding, and click OK
  3. Click 'Options' > 'Save configuration'
Now you can bring in the audio and text to align. Click 'File' > 'New trans' and select the audio file for your chapter. Next, in the text area, you can paste in the full text of your chapter. If you need to change the font (to display Chinese, etc.), click 'Options' > 'Fonts' > 'Text' and choose a new font Family and size.

Now with your audio loaded, and your text pasted in, you can begin the work of synchronizing the text with the audio. As the audio plays back, you hit "ENTER" to synchronize the text at that position with that point of the audio.

Some keyboard shortcuts to use:
  • TAB key to play or pause the audio playback
  • ENTER to split the text, creating a sync point with the audio.
  • ALT-LEFT or ALT-RIGHT to take a small jump forward or backward in the audio
Tips:
  • If you hit ENTER by mistake, creating an unwanted split, just hit CTRL-BACKSPACE to remove the split.
  • If you didn't get the timing quite right on your sync point, you can easily adjust it with your mouse. Look at the lower-most green bar -- you can drag the section divider into the correct position with your mouse. Just hold down the CTRL key while clicking and dragging with your mouse.
  • To denote paragraphs, hit CTRL-T at the beginning of a paragraph, and create a speaker named "<p>".  To mark future paragraphs, you should be able to just hit CTRL-T then ENTER.
After using the program for a little bit, you'll get faster and faster at doing the alignments.

At any time you can hit CTRL-S to save your .trs file. You may want to give it a name similar to the audio file, to help keep things organized.

OK, have fun aligning, it can become addicting! :-)

Continue on to Part 2, and learn how you can share your work with the world by making it playable in the DingLabs Reader!

Edited 3/28/10 to recommend only using WAV files with Transcriber.

Friday, January 1, 2010

New Reader Demonstration Video



A quick demonstration video that talks about what the Reader is, and shows it in action.

Saturday, December 19, 2009

My Little Contribution to Language Learning

dinglabs.com Reader screenshot
I’ve been working on a “secret” project that may help people learn languages. It provides a way to listen to audio while reading along with the corresponding text. As the audio plays, the corresponding section of text is highlighted.  You can also click on any section of the text to immediately begin playing the audio from that point.
My dream is that all language learning podcasts will race to make their content available in this way.
WHY?
Listening and reading are critical components of language learning. But like oil and water, they’re not easy to combine, so each is generally practiced in isolation. Try listening to an audiobook while reading along with the real book.  What if you want to hear the last sentence again?  It’s a pain to rewind the audio.  What if you want jump to another part of the book?  What if you just listen for a while, but then need to refer to the book again later?  It’s frustrating because the audio and the book are not connected in any way.
But what if you could combine the two?
Peanut-butter jelly time! Embedded here is an example of a very short Chinese lesson (thanks ChinesePod). Try clicking the play button, or anywhere on the transcript:


I have scaled this up to entire books! Click here to see the full book, Alice in Wonderland, in all it’s English glory:
dinglabs.com Reader screenshot
(text from Project Gutenberg, audio from LibriVox)

Benefits
  • Instantly accessible from any web browser -- no special software to download/install.
  • You can still listen without reading, or read without listening. When you want to listen to a given part, or read a given part, you don’t have to struggle to find the right place in both the audio and text.
  • Since you’re reading along in a web-page, your favorite pop-up dictionary will continue to serve you!

The intent is that anyone can create their own synchronized text/audio to share with the world.  The player is just a tool that can play back an audio file with a given transcript. Anyone can host audio and transcripts on their own website, and use the player to present them together. The player can also be embedded in a webpage, as demonstrated with the Chinese lesson above.
Why would any language learning podcast NOT make their content available in this fashion?

What’s Next
  • Connect with other people who are interested in seeing this type of idea come alive.
  • Make existing content available in this format (like synchronizing content from LibriVox,  Project Gutenberg).
  • Gather feedback from users to see what kind of crazy things become possible when you have text and audio aligned (e.g. easily generating flashcards with text and audio).
I welcome your feedback!  And feel free to notify your favorite podcast if you’d like to see their content available in this type of format.

Thursday, December 17, 2009

"Reader" - Free Learning Tool Now Online

 



The DingLabs Reader combines an audio player with an interactive transcript. The transcript is highlighted as the audio plays. You can also click on the transcript to begin playing that section of audio!

The result is a powerful language learning tool. It is also a powerful tool for learning to read, and promoting literacy -- in any language.

The best part is, it is an open platform.  Anyone can publish their audios and transcripts for playback in the DingLabs Reader. Stay tuned for more on that...