DigAcq Home

Ordering and Set Up

Access Problem Solving (digprob)

Managing Vera Records

Licensing


Cataloging

NERD






MIT Libraries

Directory of Open Access Journals (DOAJ) Import to Vera


A. Downloading and processing the file from the DOAJ

1. In your web browser, go to http://www.doaj.org/doaj2csv.cgi. You'll get a dialog box titled "Downloading doaj" that will ask you what your browser should do with the file. Save it to your Vera Import folder.

The file from the DOAJ doesn't contain all the information we need for the Vera records, nor does it contain all the information in the correct place. Rich Wenger wrote a script that will fix these things for us. The script to make these changes is called ej_format.pl, and resides on Athena at:

/afs/athena.mit.edu/dept/libraries/staff/systems/scripts/openaccess

2. In SecureFX, connect to Athena using your DOAJ Script profile. This will connect you to the path listed just above.

3. In the left hand pane, rename the file doaj2.csv (add a period between the 2 and the c).

4. Copy the file doaj2.csv from your local directory to the Athena directory.

5. In SecureCRT at an Athena prompt, type cd /afs/athena.mit.edu/dept/libraries/staff/systems/scripts/openaccess to get to that directory (that's cd, space, and then the path name)

6. Type pwd to make sure you're actually in the right place.

7. To run the script, type the following exactly as it is written below:

./ej_format.pl --infile=doaj2.csv --outfile=doaj-done.csv

8. After the script finishes running, return to SecureFX. Make sure to Refresh Views under the View menu, then FTP the finished file called "doaj-done.csv" back down to your computer into your Vera import folder.

B. Post script-running process

  1. Launch Excel, and open a new blank document.
  2. In the Data menu, choose "Import External Data" then "Import Data File...". Navigate to the file on your computer and then choose Open.
  3. When confronted with the Import Wizard, you'll have to make some choices. Your answers are
    1. Step 1: delimited (then click Next)
    2. Step 2: comma (uncheck Tab, then click Next)
    3. Step 3: general (then click Finish)
  4. Where do you want to put it? Existing worksheet.
  5. Now you can eyeball it to see if it looks like the data is in the correct columns.
  6. Save the document as an Excel file. (.xls). Go to File --> Save As... and
    • Save In: your Vera import folder
    • File name: doaj-done.xls
    • Save as type: Microsoft Excel Workbook (*.xls)
  7. You now have a file called "doaj-done.xls." This is the file you'll import into Vera.

C. Deleting the set imported into Vera last time

Before you import the current set, you have to delete the old set.

1. In Vera, find the set of records you imported from DOAJ last time by doing a Find in the Mark Set field for the text "imported from DOAJ."

2. Once you're sure you have the correct set of records, go to Records --> Delete Found Records.

D. Importing the DOAJ file into Vera

  1. In Vera, go to File --> Import records
  2. Navigate to the file on your computer "doaj-done.xls" (if it asks, it's Sheet1)
  3. Check the box "Don't import first record (Contains field names)."
  4. Match the field names on the left (how the columns are labelled in done.xls) with the field names on the right (what they are called in Vera). If you choose View By: Matching Names, it should work.
  5. Make sure you have the arrow pointing from left to right (-->) between each of the field names above.
  6. Make sure you uncheck everything else, so that there is a little zero (kind of looks like a dot) for everything else (i.e., not a --> and not a <-->)
  7. Check to make sure your choices make sense by "scanning" the data.
  8. Choose "Add new records" radio button
  9. Click Import button
  10. Check the box that asks if you want to "perform auto-enter options." (You do want to perform auto-enter options; this is how the unique Vera ID is created, how the creation date gets in there, the Creator Name, etc.)
  11. Click OK.
  12. Note the number of records imported and email to Kim so it can be added to the History of records imported in the table at the bottom of the page. There is significant duplication between the DOAJ and J-STAGE journal lists. In order to avoid duplicating the open-access records that come in the J-STAGE load, do a find with "DOAJ" in the Mark set field, and "http://www.jstage" in the URL_Native field. Note the number of records returned. The delete the returned set with "Delete fournd records." Subtract the number of J-STAGE records deleted from the total original number of DOAJ records imported, and e-mail this figure to Kim so it can be added to the DOAJ import history.

E. Global Replaces for the Set

Mark Set

  1. Add the text "imported from DOAJ, <today's date>" to the Mark Set field of the first record in the set.
  2. Click in the field so that your cursor is in the field, then go to Records --> Replace...
  3. It will ask if you want to replace the contents of the Mark Set field to whatever you just typed in the [number of] records in the found set. You have to actually choose Replace by clicking on it (Enter will cancel the process.)

History of records loaded into Vera

Records should be loaded into Vera every two months.

Date
Records
5/13/2004
1,086
11/12/2004
1,357
3/21/2005
1,485
5/20/2005
1,562
7/15/2005
1,642
9/22/2005
1,706 (does not include the 71 records that are also part of JSTAGE and which we deleted from this DOAJ load)
11/21/2005
1,840 (does not include the 84 records that are also part of JSTAGE and which we deleted from this DOAJ load)
12/2/2005
1,865 (does not include the 85 records that are also part of JSTAGE and which we deleted from this DOAJ load)
1/20/2006 1,922 (does not include the 85 records that are also part of JSTAGE and which we deleted from this DOAJ load)
6/1/2006 2,045 (does not include the 173 records that are also part of JSTAGE and which we deleted from this DOAJ load)
7/27/2006 2,229 (does not include the 92 records that are also part of JSTAGE and which we deleted from this DOAJ load)
9/29/2006 2,315 (does not include the 86 records that are also part of JSTAGE and which we deleted from this DOAJ load)
11/17/2006 2,381 (does not include the 86 records that are also part of JSTAGE and which we deleted from this DOAJ load)
6/1/2007 2,633 (does not include the 86 records that are also part of JSTAGE and which we deleted from this DOAJ load)
8/10/2007 2,718 (does not include the 85 records that are also part of JSTAGE and which we deleted from this DOAJ load)
10/6/2007 2,773 (does not include the 85 records that are also part of JSTAGE and which we deleted from this DOAJ load)
12/21/2007 2,946 (does not include the 83 records that are also part of JSTAGE and which we deleted from this DOAJ load)
2/15/2008 3,104 (does not include the 81 records that are also part of JSTAGE and which we deleted from this DOAJ load)
3/29/2008 3,222 (JSTAGE was down so couldn't get that file)
5/2/2008 3,258 (does not include the 82 records that are also part of JSTAGE and which we deleted from this DOAJ load)
11/6/2008
3726 total titles, minus 73 JSTAGE titles = 3653 net DOAJ titles
2/24/2009 3788 titles (3861 gross, minus 73 JSTAGE titles)
5/7/09 4045 (4124 gross, minus 71 JSTAGE titles)
10/9/09 4280 (4360 gross, minue 70 JSTAGE titles)
Written by Kim Maxwell; Last updated by Kim Maxwell, October 21, 2009