Project

General

Profile

Bug #438

Feature #403: Perseus Book Importer

Importer fails on works with no biblStruct

Added by Luke Murphey over 11 years ago. Updated over 11 years ago.

Status:
Closed
Priority:
High
Assignee:
Target version:
Start date:
Due date:
% Done:

100%


Description

The Perseus importer fails on works that have no biblStruct:

reader.importer.PerseusBatchImporter: Exception generated when attempting to process file="hh_gk.xml" 
Traceback (most recent call last):
  File "/Users/lmurphey/Documents/SP/Workspace/TextCritical.com/src/reader/importer/PerseusBatchImporter.py", line 326, in process_directory
    if self.__process_file__( os.path.join( root, f) ):
  File "/Users/lmurphey/Documents/SP/Workspace/TextCritical.com/src/reader/importer/PerseusBatchImporter.py", line 277, in __process_file__
... 5 lines omitted ...
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/django/db/transaction.py", line 209, in inner
    return func(*args, **kwargs)
  File "/Users/lmurphey/Documents/SP/Workspace/TextCritical.com/src/reader/importer/Perseus.py", line 536, in import_xml_document
    bibl_struct_node = tei_header.getElementsByTagName("biblStruct")[0]
IndexError: list index out of range

The following works cannot be imported due to this issue:

1    hh_gk.xml
2    aristot.vir_gk.xml
3    aristot.nic.eth_gk.xml
4    aristot.ath.pol_gk.xml
5    apollod_gk.xml

Associated revisions

Revision 152 (diff)
Added by Luke Murphey over 11 years ago

Fixed issue with importer where it fails to import works that do not have a biblStruct object. Reference #438.

Revision 152 (diff)
Added by Luke Murphey over 11 years ago

Fixed issue with importer where it fails to import works that do not have a biblStruct object. Reference #438.

Revision 146 (diff)
Added by Luke Murphey over 11 years ago

Fixed issue with importer where it fails to import works that do not have a biblStruct object. Reference #438.

History

#1 Updated by Luke Murphey over 11 years ago

  • Parent task set to #403

#2 Updated by Luke Murphey over 11 years ago

  • Description updated (diff)

#3 Updated by Luke Murphey over 11 years ago

  • Status changed from New to In Progress

#4 Updated by Luke Murphey over 11 years ago

  • Status changed from In Progress to Closed
  • % Done changed from 0 to 100

Also available in: Atom PDF