The new data is now in the schema named 'mimir'. The 'minerva' schema will be renamed 'mimir_pre_1_2011'.
Updated Datasets
The following tables have have been updated to contain more recent data:
courtesy_appointments, dissertations, dissertation_members, pta/pta_roles, spo_projects/spo_project_segments/spo_roles, student_programs/students, and woods and biox (combined into new centers_membership table)
Note that we are still waiting for an update to the FSI tables. Also note that there will likely be an additional round of person_id cleanup in the next few weeks, focusing mainly on splitting incorrectly merged person entities (this is just cleanup though and will not changed most ids).
Person Entity Improvements
In the previous update, a less strict procedure for normalizing names and resolving person entities was followed, which resulted in a much larger number of person_ids than it should have. We've removed these person_ids and re-matched all those names - in the previous update, there were 70,281 unique names and 56,521 person_ids, whereas now there are 37,368 unique names and 28,955 person_ids.
The process for performing the name resolution has been improved, and additional tools have been added to correct cases where two person_ids need to be merged into one, and where a single person_id needs to be split into separate entities. However, this means that some person_ids have been reassigned or changed. Any table that involves person_ids should be regenerated or rechecked against the people table to make sure the person_ids still exist.
Also, records that contain bogus or empty person name fields have been assigned a person_id of 0. You likely want to exclude these from any queries you write.
Database Structure Changes
There have been a number of structural changes to the database in order to be more consistent and more reliable in terms of maintaining person_ids.