Code4lib Day 1: Lightning Talks

Cynthia Ng  – RULA Bookfinder

Julien Gibert – Turning a Solr Response into a RDF file

  • Theses.fr
  • Sorry, this went by me, plus I was busy running back to my seat

Bill Dueber – Datamart Report Generator at UMich

  • actually talking about spreadsheets
  • want to support data-drive decision making, but it’s boring, and canned reports tend not to do it
  • can end up in substring hell
  • solution: build data warehouse
  • took Aleph oracle COBOL store, removed insanity and put it in another oracle database
  • funds and inventory reports now possible
  • running 20-25 reports a week
  • more than when we ran it by hand, and saves lots of time

Jonathan Rochkind – bento_search

  • RubyRails gem
  • external search services e.g. Google books
  • federated e.g. primo, eds, ebscohost, scopus, worldcat, google books
  • can use whatever you want, just need to add it
  • can customize to have link resolver
  • github.com/jrochkind/sample_megasearch/
  • much more functionality

Masao Takaku – saveMLAK project for two years

  • came out of the effort to save museum, library, archive, kominkan (community centre) after the big earthquake
  • gather information of facilities in damaged area using a wiki
  • coordinate activities to rebuild
  • efforts are still continuing

Jon Stroop – Loris Image Server

  • define syntax for image access
  • can specify width/height, part of image, quality
  • Talk link

Ross Singer – How are you managing copyright?

  • lazy attempt at crowd-sourced business development
  • copyright is complicated
  • there are standard licenses, but then there are a lot of exclusions and exceptions
  • still, roughly the same model
  • management already being done in some capacity by the universities
  • but in US/Canada there is fair dealing and fair use
  • Slides

Eric Nord – Candybars for Bugs

  • Harold B. Lee Library
  • worked on maps in library
  • pop up map
  • will give candy bar if found error
  • only had to give away 18
  • have a ‘report a problem’ with this item
  • builds the idea to power the patron

Megan O’Neill Kudzia – Games for Pedagogy in the Library

  • working with faculty
  • a lot of interest, but no opportunity to talk about it
  • purchasing games on an ask basis
  • working out how to make accessible, in catalogue
  • licensing issues for PC/console games

Geoffrey Boushey – GEDI Reference App for InterLibrary Loan

  • General Electronic Document Interchange (ISO Standard)
  • used by Ariel
  • headers added to a file when sent from one institution to another
  • basis for making an easy to use tool so different ILL systems can communicate with each other
  • on Github

George Campbell – three.js: 3D Objects in the browser

  • used to have to use flash or flip through images
  • can now use interactive 3D graphics
  • can scale, add text/images, move

John Sarnowski – Audio Archiving with Full Text Search

  • ResCarta Toolkit
  • display and play audio
  • add metadata
  • use conversion tool
  • embeds into XML portion
  • final file can then be searched
  • words can be highlighted just like a text file

That’s the end of Day 1! Join us tomorrow. Time for a nap.

sleeping panda cubs

Code4lib Day 1: Lightning Talks Notes

Al Cornish – XTF in 300 seconds (Slides in PDF)

  • technology developed and maintained by California Digital Library
  • supports the search/display of digital collections (images, PDFs, etc)
  • fully open source platform, based on Apache Lucene search toolkit
  • Java framework, runs in Tomcat or Jetty servlet engine
  • extensive customization possible through XSLT programming
  • user and developer group communication through Google Groups
  • search interface running on Solr with facets
  • can output in RSS
  • has a debug mode

Makoto Okamoto – saveMLAK (English)

  • Aid activities for the Great East Japan Earthquake through collaboration via wiki
  • input from museum, library, archive, kominkan = MLAK
  • 20,000 data of damaged area
  • Information about places, damages, and relief support
  • Key Lessons
    • build synergy with twitter
    • have offline meet ups & training

Andrew Nagy – Vendors Suck

  • vendors aren’t really that bad
  • used to think vendors suck, and that they don’t know how to solve libraries’ problems
  • but working for a vendor allows to make a greater impact on higher education, more so than from one university (he started to work for SerialsSolution)
  • libraries’ problems aren’t really that unique
  • together with the vendor, a difference can be made
  • call your vendors and talk to the product managers
  • if they blow you off, you’ve selected the wrong vendor
  • sometimes vendor solutions can provide a better fit

Andreas Orphanides – Heat maps

The library needed grad students to teach instructional sessions, but how to set schedule when classes have a very inflexible schedule? So, he used the data of 2 semesters of instructional sessions using date and start time, but there were inconsistent start times and duration. The question is how best to visualize the data.

  • heatmap package from clickheat
  • time of day – x-dimension
  • day of the week – y-dimension
  • could see patterns in way that you can’t in histogram or bar graph
  • heat map needn’t be spatial
  • heat maps can compare histogram-like data along a single dimension or scatter-like plot data to look for high density areas

Gabriel Farrell – ElasticSearch

Nettie Lagace from NISO

  • National Information Standards Organization (NISO)
  • work internationally
  • want to know: What environment or conditions are needed to identify and solve the problem of interoperability problems?

Eric Larson – Finding images in book page images

A lot of free books exist out there, but you can’t have the time to read them all. What if you just wanted to look at the images? Because a lot of books have great images.

He used curl to pull all those images out, then use imagemagick to manage the images. The processing steps:

  1. Convert to greyscale
  2. Contrast boost x8
  3. Covert image to 1px by height
  4. Sharpen image
  5. Heavy-handed grayscaling
  6. Convert to text
  7. Look for long continuous line of black to pull pages with images

Code is on github

Adam Wead – Blacklight at the Rock Hall

  • went live, soft launch about a month ago
  • broken down to the item level
  • find bugs he doesn’t know about for a beer!

Kelley McGrath – Finding Movies with FRBR & Facets

  • users are looking for movies, either particular movie or genre/topic
  • libraries describe publications e.g. date by DVD, not by movie
  • users care about versions e.g. Blu-Ray, language
  • Try the prototyped catalog
  • Hit list provides one result per movie, can filter by different facets

Bohyun Kim – Web Usability in terms of words

  • don’t over rely on the context
  • but context is still necessary for understanding e.g. “mobile” – means on the go, what they want on the go
  • sometimes there is no better term e.g. “Interlibrary Loan”
  • brevity will cost you “tour” vs. “online tour”
  • Time ran out, but check out the rest of the slides

Simon Spero – Restriction Classes, Bitches

OWL:

  • lets you define properties
  • control what the property can apply to
  • control the values the property can take
  • provides an easy way to do this
  • provides a really confusing way to do this

The easy way is usually wrong!

When defining what can apply to and the range, this applies to every use of the property. An alternative is Attempto.

Cynthia Ng – Processing & ProcessingJS

  • Processing: open source visual programming language
  • Processing.js: related project to make processing available through web browsers without plugins
  • While both tend to focus on data visualizations, digital art, and (in the case of PJS) games, there are educational oriented applications.
  • Examples:
    • Kanji Compositing – allows visual breakdown of Japanese kanji characters, interact with parts, and see children.
    • Primer on Bezier Curves – scroll down to see interactive (i.e. if you move points, replots on the fly) and animated graphs.
  • Obvious use might be instructional materials, but how might we apply it in this context? What other applications might we think of in the information organization world?

Since doing the presentation, I have already gotten one response by Dan Chudnov who did a quick re-rendering of newspaper data from OCR data. Still thinking on (best) use in libraries and other information organizations.

It’s over for today, but if you’d like more, do remember that there is a livestream and you can follow on twitter, #c4l12 or IRC.