Code4Lib 2014: Day 1 Morning

Morning talks of Day 1 at Code4lib 2014.

A Book, a Web Browser and a Tablet: How Bibliotheca Alexandrina’s Book Viewer Framework Makes it Possible – Engy Morsy

Book viewer that was developed inside the library.

Goals

multiple digitization projects for preserving knowledge in monographs and availing them to people around the world
readers miss features they like in old books
availability of ebook readers

Why not PDF

copyright – limited by license
have to be recreated every time the license changes on what is allowed
page selection
morphological search could not have been done otherwise
all in one (metadata, content and search) object
streaming/browser compatibility – have to wait until download the whole PDF, frequently have to install a plugin
personalization/annotation – can only be done offline

Book Viewer Framework

two main parts:
* client side
* server side
client side
* can highlight
* create sticky notes
* annotations
* authenticate and can save these
* different views e.g. thumbnails, my basket (for out of copyright works, can select pages and download own compilation)
* metadata can be viewed
* search results within the PDF with page numbers and snippets, search highlights
* view related books
* standard zoom functions but with functionality to download higher quality version once a threshold is reached
server side
* storage layer – mainly responsible for streaming images
* content serving layer will process images for client display
* using Solr 4.1, but can use any framework that you like

Future Work

include other type of media objects e.g. manuscripts
android implementation
enhance storage layer

Book site viewer

Quick and Easy Data Visualization with Google Visualization API and Google Chart Libraries – Bohyun Kim

Spreadsheets are not databases, and they’re not efficient, but database might not be convenient, might not have knowledge or resources in house.

Use spreadsheet as data source, query using visualization API, and charts to visualize.

Two Ways to Display Result
* get a stand alone page with table
* chart and table with JavaScript and Google Charts

Example

Really big spreadsheet.

All you need is a id, sheet id, columns you want.

You can use aggregate functions e.g. count, limit, avg, pivot

Use JS instead of a URL
* set query string within JavaScript call
* can drop it into your own page
* can also specify header, range

Visualize

load Chart libraries, list data, select options
visualize table, chart
chart.draw = longest and verbose, but most flexible with ability to draw multiple charts on one page, and requires to know which chart library that you want to use
chartwrapper = shorter, but only allows certain functions
drawchart – quickest, but can’t handle any events, and don’t need to load packages

Charts has limitation of up to 400,000 cells, 256 columns.

Use Own Database to load into Google Chart Libraries

need to contruct datatable from SQL database
encode into JSON
then use Google Chart

Controls

Can give control to your users using dashboard draw ability e.g. change age and gender

Final Notes

Take a look at the
* code playground.
* public data exchange

Slides, Code

WebSockets for Real-Time and Interactive Interfaces – Jason Ronallo

What just happened?

When change slides, presentation sends a message to Websocker server: node + socket.io

What Problem it’s Trying to Solve

browser is always making requests, start have to
browser always has to make request

Solution

Polling (AJAX), but some problems so came up with
Long Polling (Hanging GET), also tried but had the problem
queuing latency
server sent events, where after initial handshake, server can stream data to client, but client can’t communicate back
Websocket comes along to create consistent connection where data can flow in both directions, can even send data at the same time with full duplex in real time
gets around queuing latency.
payload can send UTF-8, binary, whatever

Why Not Use it For Everything?

lose XHR, HTTP services
e.g. cache
Want to use the right tool for the job

Websockets good for?

collaborative
real-time display of activity
remote control
e.g. Google Analytics Real-time and see digital collection whie viewing them
did tracking to show popularity this week or today, but only updated every hour

Vanity Metrics

show value in a way that makes sense, won’t change decisions on changing website, but show some interesting things
real time interaction with the website
using Rails + Blacklight Server to node + express + socket.io
can use it for multiple applications

Practical Use

immersion theatre to show big video walls
inspired by listen to wikipedia – visualization and audiation of wikipedia edits
starting thinking about simple remote controls
provides URL to interact with the wall
token-based authentication, but future might use geolocation, BLE, NEC
remote control interface is you can turn language on/off
goal: low latency, like pressing button on the wall
keep clients in sync by using websockets
also see content that’s being display, e.g. stream of articles being edited and pause

Unintended Consequences

deleted a large portion of an article just to see how big the bubble would be
vizualization wall – take control by choose control add content to wall, then be able to see and explore the actual content
chrome screen emulation – resolution wasn’t that high on the images, so no efficienty issues

Conclusion

Notetaker’s: This is really cool

> This is your moment of zen.

Personalizing Google Analytics Using Events and Custom Variables – Josh Wilson

Events and CUstom Variables

out of the box: GA gives you general data on audience (who), acquisition (how), behavior (what)
record personalized data about your sit’es usage
custom variables adds data about who’s doing the stuff, conceptually like a custom segment e.g. users not logged in, people with items in shopping cart
events = stuff that’s happening beyond basic pageview e.g. download, facet click, video play

Which to use?

depends on what you want to know: more about people or what happens
reports look like any in GA

Examples

Contentdm
* can work with universal analytics which you can enable website config tool
* defaults analytics provides a number of detailed events
* but also wanted to do custom example e.g. data on whether certain metadata popped up in views (state agency)
* custom GA script for use in contentdm
Drupal
* GA module has built-in fields to add custom variables
* can also add custom snippets
* using it to track

Implementation

Your CMS might do it for you as plugin

The code
* add one more line of code, and define how to label it
* only required is category and action, but can add label and other fields

Considerations
* extra JS, usually some sort of trigger code is required
* changing a category/action/label – surprise, it’s now a new event
* balance clarify with consistency
* test using HTTP watching tool e.g. HTTPFox (firefox plugin) or just wait a day
Gotchas
* timing is everything
* can’t fire event if GA isn’t loaded
* can’t record page elemtn value if it’s not loaded
* custom variables must be set before trackpageview
* tracking a download or link-click
* you have to check IE8
* multiple domains/subdomains requires extra switches

Code

Discovering your Discovery System in Real Time – Godmar Back and Annette Bailey

Observations in 2014
* majority of resources are electronic
* physical space mostly for student work areas and digital displays
* librarians = active partners in research
* libraries replacing OPACs with discovery systems
Risks associated with these changes
* use of library collections becomes invisible (more difficult to assess or track?)
* search engines compete with discovery systems for users requiring substantial marketing effort
How do we make the discovery process and use of resources visible?

Can We See What our Users are Doing?

COUNTER standard which gives us usage data
* usually after the fact
* unfortunately interpreted differently
discovery service provides data on search terms & click data, but aggregate and not real-time

What if we could see what our users are doing in real time while still affording them some privacy? What if we could share that with other members of the community?

Discovering Discovery

Wanted to visualize real-time items users discover as they click results.

Added some code to be able to see what is being accessed in different visualizations e.g. Twitter-style, cube, ticker style

How it Works

Use custom JS, record clicks (extract ID from HTML), save to visualization server with database, get record using Summon API, push to D3 based visualizations or Google Charts.

Conclusion

A way to clearly see what research is being used.

A Book, a Web Browser and a Tablet: How Bibliotheca Alexandrina’s Book Viewer Framework Makes it Possible – Engy Morsy

Goals

Why not PDF

Book Viewer Framework

Future Work

Quick and Easy Data Visualization with Google Visualization API and Google Chart Libraries – Bohyun Kim

Example

Visualize

Use Own Database to load into Google Chart Libraries

Controls

Final Notes

WebSockets for Real-Time and Interactive Interfaces – Jason Ronallo

What just happened?

What Problem it’s Trying to Solve

Solution

Why Not Use it For Everything?

Websockets good for?

Vanity Metrics

Practical Use

Unintended Consequences

Conclusion

Personalizing Google Analytics Using Events and Custom Variables – Josh Wilson

Events and CUstom Variables

Which to use?

Examples

Implementation

Discovering your Discovery System in Real Time – Godmar Back and Annette Bailey

Can We See What our Users are Doing?

Discovering Discovery

How it Works

Conclusion

Lunch Time

Share this: