This module allows to import taxonomies, structures or simple lists of terms into a vocabulary from a CSV file, a url or a copy-and-paste text.
Csv format is a simple list of values separated by a delimiter, often comma (,
) or semi-colon (;
), and enclosures, often quotation marks ("
). If you are unsure how to create a CSV file, you might want to use OpenOffice Calc or another spreadsheet application to export your data into a CSV file.
To import terms, user need one permission:
Four items need to be set in order to import your terms.
Source can be configured with the first field set. The first column always contains the term name. You can specify what you want to import and how additional columns should be imported.
Vocabulary structure should be imported first when multiple files are used. So choose options flat, structure, first level of parents or children before relations, descriptions, etc.
Terms are imported as simple terms. Additional columns are ignored. Example:
Diamond
Gold
Wood
Use this option to import a lot of terms in order to create a flat vocabulary. All items in your file will be imported as terms. Example:
Clothes, Trees, Houses
Paris, Drupal
Use this option to create a tree structure of a vocabulary (geography, classification...). To import a hierarchy with multiple parents as a genealogical one, it's advised to use "Polyhierarchy", "First level children" or "First level parents" imports.
Your file can be written with two schemes.
In the first scheme, you need to set all ancestors to each term. The second column will be imported as the name of a child term of the term defined by the first column. The third column will be imported as a child of the second column, and so on. Lines can have any order. Example:
Animal, Mammal, Whale
Animal, Mammal, Monkey
Animal, Mammal, Human
Mammal, Human
Mammal
> is imported as a first level term and not as a < Animal
> term child as in previous line.
World
Asia
Japan
Tokyo
Korea
Seoul
World
,Asia
,,Japan
,,,Tokyo
,,Korea
,,,Seoul
,Europe
,,France
,,,Paris
,,Italia,Abruzzo,Chieti,Chieti
,,,,,Civitaluparella
Paris
> will be automatically added as a child of < France
> and so on.
Partial lines are allowed, so a next line can be:
,,Switzerland,Bern
Switzerland
> will be added as a child of < Europe
> and of course < Bern
> as a child of < Switzerland
>.
In same way, above lines can be simplified to:
World, Asia, Japan, Tokyo
,, Korea, Seoul
World, Europe, France, Paris
Seoul
> will become a child of < Japan
>.
Use this option to create a a polyhierarchical structure, as a genealogy.
Format is the same than tree structure: each term is the child of previous item: parent, child, sub-child... and so on.
There are four differences. First, the first item doesn't need to be a root. Second, duplicate terms are always merged, except when one is the direct parent of the other one, because it's forbidden in Drupal. So, if the vocabulary is monohierarchical and without non-direct duplicate terms, as in the previous geographical example, result is the same than with previous option. Third, lines can be partial too, but in some case of duplicates, result may differ. Last, polyhierarchy can be recursive.
For example, lines may be:
Grand-Mother, Mother, Daughter
Grand-Father, Mother, Son
Grand-Mother 2, Father, Daughter
Grand-Father 2, Father, Son
, , Son 2
, Uncle
Grand-Mother 2, Uncle
Father, Son 3
This choice allows to create a vocabulary in which terms can have more than one parent, as in a genealogy (polyhierarchy).
The first item is imported as a term, the second and next as first level parents of first term. For example, lines may be:
Daughter, Mother, Father
Son, Mother, Father
Son 2, Father
Son 3, Father
Mother, Grand-Mother, Grand-Father
Father, Grand-Mother 2, Grand-Father 2
Uncle, Grand-Mother 2, Grand-Father 2
This choice allows to create a vocabulary in which terms can have more than one parent, as in a genealogy (polyhierarchy).
The first item is imported as a term, the second and next as first level children of first term. For example, lines may be:
Mother, Daughter, Son
Father, Daughter, Son, Son 2, Son 3
Grand-Mother, Mother
Grand-Father, Mother
Grand-Mother 2, Father, Uncle
Grand-Father 2, Father, Uncle
Warning: in Drupal 7, related terms have been removed and replaced by fields. So this option is available only with Drupal 6. To get same functionnality, you need to create a "relations" field.
Use this option to create links between terms, as "see also" in an encyclopedia.
The second and next columns will be imported as related terms of the first column term. For example, a line may be:
Baobab, Madagascar, Ghost
Baobab
> will be related to < Madagascar
> and < Ghost
>. An option allow you to create subrelations, here between < Madagascar
> and < Ghost
> (see below).
The second column will be imported as the term description of the first column term. Example:
"Baobab","An African tree"
The second column will be imported as weight of the term defined by first column. Example:
Baobab, 3
The second and next columns will be imported as synonyms terms of the first column term. Example:
"United Kingdom","Great Britain","England"
This format is used by Taxonomy manager to export vocabularies. Columns are: vocabulary id, term id, term name, term description, first level parent 1, ..., first level parent n. Example:
"1"; "25"; "Louis XVII"; "King of France"; "Louis XVI"; "Marie-Antoinette"
You can import your terms from a file or with a text area. Simply check your choice. File can be a local file path or a url.
Advanced settings allow to set non standard delimiter and enclosure.
You can import your terms in a existing vocabulary or create a new one. You can import your terms too in a duplicate of an existing vocabulary.
When you want to import a new taxonomy into an existing one, it is recommended to process in three steps in order to allow a good import.
If you only want to create a new vocabulary, the first choice is sufficient, unless when you have multiple files for one vocabulary.
Destination can be configured with the second field set. You can specify what will become existing terms. Four choices are possible, or less matching your source content:
Update current terms when name matches with imported ones and merge existing descriptions, parents, synonyms and related terms with new ones. Duplicates are removed. This choice is recommended if you want to keep a vocabulary and if you have descriptions, parents, synonyms and related terms you don't want to lose.
Update current terms if name matches, but remove existing descriptions, parents, synonyms and related terms. Choose this option if you are want to create a clean new vocabulary without losing existing terms.
Be careful: if there is no description, parent, synonym or related term, i.e. the line contains only a term in the first column, this removes the existing.
Let current terms as they are and create a new term for the first column term.
Warning: This can create duplicate terms. It is recommended to use this option only if you are sure that imported taxonomy contains only new terms or if your vocabulary allows multiple parents.
Create a new term for each term on the line.
To disable internal cache allows to import vocabularies of any size. Internal cache is used to speed up process, to reduce access to sql base an to be informed about process. When disabled, no information about results can be displayed except eventual first error or warning.
As to calculate vocabulary hierarchy is memory intensive, this option allows to set hierarchy manually without verify it.
If you are sure that vocabulary to import is well formated (utf8, order of items...), you can disable checks. This option increases import speed by 5%.
,
" by default, semicolon ";
" or tabulation) between terms can be chosen in Advanced settings in the second fieldset. You can choose a custom delimiter too. Delimiter is a one character string. Example with delimiter < ¤
>:term 1¤This field has commas, a semicolon (;), a quotation mark (") and a tabulation, but it will be correctly imported.
"
), specialy if they contain non-ASCII letters or if imported items, in particular descriptions, contain the chosen delimiter. Example:"term 1","This field has a comma, but it will be correctly imported."
"
) are automatically managed.
Another Drupal module allows CSV import too, despite its name: taxonomy XML. Its approach is different: it uses one file complient to thesauri standard ISO 2788, i.e. a three columns csv file: first term, type of link, second term
, or, for specialists, subject, predicate, object
. For Drupal 4.7 and Drupal 5, taxonomy batch operations is available too. So choose the module best matching your needs.