<eml:eml xmlns:eml="https://eml.ecoinformatics.org/eml-2.2.0"
         xmlns:dc="http://purl.org/dc/terms/"
         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
         xsi:schemaLocation="https://eml.ecoinformatics.org/eml-2.2.0 https://rs.gbif.org/schema/eml-gbif-profile/1.3/eml.xsd"
         packageId="863890c7-c5ce-4fd2-ad32-c3bdf510c2b2/v1.1" system="http://gbif.org" scope="system"
         xml:lang="eng">
    <dataset>
        <alternateIdentifier>863890c7-c5ce-4fd2-ad32-c3bdf510c2b2</alternateIdentifier>
        <alternateIdentifier>https://www.verspreidingsatlas.nl/ipt/resource?r=flora-batava</alternateIdentifier>
        <shortName>Flora Batava</shortName>
        <title xml:lang="eng">Flora Batava (1800-1934): From Historical Citizen Science to Plant Humanities Dataset</title>
        <creator>
            <individualName>
                <givenName>Laurens</givenName>
                <surName>Sparrius</surName>
            </individualName>
            <organizationName>FLORON Plant Conservation Netherlands</organizationName>
            <address>
                <city>Nijmegen</city>
                <country>NL</country>
            </address>
            <userId directory="https://orcid.org/">0000-0002-4925-9289</userId>
        </creator>
        <metadataProvider>
            <individualName>
                <givenName>Luiza</givenName>
                <surName>Teixeira-Costa</surName>
            </individualName>
            <organizationName>Royal Netherlands Academy of Arts &amp; Sciences (KNAW)</organizationName>
            <address>
                <city>Amsterdam</city>
                <country>NL</country>
            </address>
            <userId directory="https://orcid.org/">0000-0002-1405-8567</userId>
        </metadataProvider>
        <associatedParty>
            <individualName>
                <givenName>Esther </givenName>
                <surName>van Gelder</surName>
            </individualName>
            <organizationName>KB nationale bibliotheek</organizationName>
            <address>
                <city>Den Haag</city>
                <country>NL</country>
            </address>
            <userId directory="https://orcid.org/">0000-0002-4505-6302</userId>
            <role>custodianSteward</role>
        </associatedParty>
        <associatedParty>
            <individualName>
                <givenName>Folgert</givenName>
                <surName>Karsdorp</surName>
            </individualName>
            <organizationName>Royal Netherlands Academy of Arts &amp; Sciences (KNAW)</organizationName>
            <address>
                <city>Amsterdam</city>
                <country>NL</country>
            </address>
            <userId directory="https://orcid.org/">0000-0002-5958-0551</userId>
            <role>principalInvestigator</role>
        </associatedParty>
        <pubDate>
            2026-04-12
        </pubDate>
        <language>dut</language>
        <abstract>
            Flora Batava: people, plants, locations lists 11,500+ records of all species in the first illustrated flora of the Netherlands, published in 28 volumes between 1800 and 1934. The dataset includes information about the plants, the people who observed them in each locality, and the publication of each volume. KB, the National Library of the Netherlands holds both original and digitized source material. From the latter, data was segmented and extracted using a generative AI model (OpenAI’s GPT-4), then checked and corrected manually. Including social (e.g., observers’ names, sex) and historical information (e.g., old plant names, publication history), this dataset facilitates research in plant humanities, botanical heritage, and social history of science.
        </abstract>
        <keywordSet>
            <keyword>Occurrence</keyword>
            <keywordThesaurus>GBIF Dataset Type Vocabulary: http://rs.gbif.org/vocabulary/gbif/dataset_type_2015-07-10.xml</keywordThesaurus>
        </keywordSet>
        <keywordSet>
            <keyword>Observation</keyword>
            <keywordThesaurus>GBIF Dataset Subtype Vocabulary: http://rs.gbif.org/vocabulary/gbif/dataset_subtype.xml</keywordThesaurus>
        </keywordSet>
        <intellectualRights>
            <para>This work is licensed under a <ulink url="http://creativecommons.org/licenses/by/4.0/legalcode"><citetitle>Creative Commons Attribution (CC-BY 4.0) License</citetitle></ulink>.</para>
        </intellectualRights>
        <licensed>
            <licenseName>Creative Commons Attribution 4.0 International</licenseName>
            <url>https://spdx.org/licenses/CC-BY-4.0.html</url>
            <identifier>CC-BY-4.0</identifier>
        </licensed>
        <distribution scope="document">
            <online>
                <url function="information">https://doi.org/10.5334/johd.497</url>
            </online>
        </distribution>
        <distribution scope="document">
            <online>
                <url function="download">https://www.verspreidingsatlas.nl/ipt/archive.do?r=flora-batava</url>
            </online>
        </distribution>
        <coverage>
            <geographicCoverage>
                <geographicDescription>The Netherlands and surroudings</geographicDescription>
                <boundingCoordinates>
                    <westBoundingCoordinate>1.978</westBoundingCoordinate>
                    <eastBoundingCoordinate>8.438</eastBoundingCoordinate>
                    <northBoundingCoordinate>53.645</northBoundingCoordinate>
                    <southBoundingCoordinate>49.325</southBoundingCoordinate>
                </boundingCoordinates>
            </geographicCoverage>
            <temporalCoverage>
                <rangeOfDates>
                    <beginDate>
                        <calendarDate>1790-01-01</calendarDate>
                    </beginDate>
                    <endDate>
                        <calendarDate>1934-12-31</calendarDate>
                    </endDate>
                </rangeOfDates>
            </temporalCoverage>
        </coverage>
        <maintenance>
            <description>
                <para></para>
            </description>
            <maintenanceUpdateFrequency>notPlanned</maintenanceUpdateFrequency>
        </maintenance>
        <contact>
            <individualName>
                <givenName>Laurens</givenName>
                <surName>Sparrius</surName>
            </individualName>
            <organizationName>FLORON Plant Conservation Netherlands</organizationName>
            <address>
                <city>Nijmegen</city>
                <country>NL</country>
            </address>
            <userId directory="https://orcid.org/">0000-0002-4925-9289</userId>
        </contact>
        <methods>
            <methodStep>
                <description>
                    <para>For a detailed description see https://openhumanitiesdata.metajnl.com/articles/10.5334/johd.497</para>
                </description>
            </methodStep>
            <sampling>
                <studyExtent>
                    <description>
                        <para>Observations of plants and fungi published in the Flora Batava journal (1800–1934).</para>
                    </description>
                </studyExtent>
                <samplingDescription>
                    <para>Scans of the journal&apos;s pages were processed with Optical Character Recognition and Handwritten Text Recognition. Text segmentation was used to classify paragraphs with labels such as “species names”, “flowering time”, “classification”, “sexual characteristics”, “species traits”, “habitat”, “medicinal use”, “domestic use”. Observations were extracted from the habitat sections using Generative AI. Geocoding for locality names was done with Nominatim (R) and Generative AI. Data was enriched by combining data from the national checklists and a database with historical observers of flora and fauna.</para>
                </samplingDescription>
            </sampling>
            <qualityControl>
                <description>
                    <para>After data extraction, all entries were individually checked against the source material and manually corrected if needed regarding spellings and correctness of the information. Entries were again manually checked during geocoding and data enrichment.</para>
                </description>
            </qualityControl>
        </methods>
        <project>
            <title>Flora Batava</title>
            <personnel>
                <individualName>
                    <givenName>Luiza </givenName>
                    <surName>Teixeira-Costa</surName>
                </individualName>
                <userId directory="https://orcid.org/">0000-0002-1405-8567</userId>
                <role>principalInvestigator</role>
            </personnel>
            <personnel>
                <individualName>
                    <givenName>Esther</givenName>
                    <surName>van Gelder</surName>
                </individualName>
                <userId directory="https://orcid.org/">0000-0002-4505-6302</userId>
                <role>custodianSteward</role>
            </personnel>
            <personnel>
                <individualName>
                    <givenName>Laurens </givenName>
                    <surName>Sparrius</surName>
                </individualName>
                <userId directory="https://orcid.org/">0000-0002-4925-9289</userId>
                <role>contentProvider</role>
            </personnel>
            <personnel>
                <individualName>
                    <givenName>Folgert</givenName>
                    <surName>Karsdorp</surName>
                </individualName>
                <userId directory="https://orcid.org/">0000-0002-5958-0551</userId>
                <role>principalInvestigator</role>
            </personnel>
        </project>
    </dataset>
    <additionalMetadata>
        <metadata>
            <gbif>
                <dateStamp>2026-04-10T14:46:00.647+02:00</dateStamp>
                <hierarchyLevel>dataset</hierarchyLevel>
                <citation>Teixeira-Costa L, van Gelder E, Sparrius L, Karsdorp, F  (2026). Flora Batava (1800-1934): From Historical Citizen Science to Plant Humanities Dataset. Version 1.1. FLORON Plant Conservation Netherlands. Occurrence dataset. https://www.verspreidingsatlas.nl/ipt/resource?r=flora-batava&amp;amp;v=1.1</citation>
                <bibliography>
                    <citation identifier="https://doi.org/10.5334/johd.497">Teixeira-Costa L, van Gelder E, Sparrius LB, Karsdorp, F (2026). Flora Batava (1800-1934): From Historical Citizen Science to Plant Humanities Dataset. Journal of Open Humanities Data 12: 4.</citation>
                </bibliography>
                <resourceLogoUrl>https://www.verspreidingsatlas.nl/ipt/logo.do?r=flora-batava</resourceLogoUrl>
                <dc:replaces>863890c7-c5ce-4fd2-ad32-c3bdf510c2b2/v1.1.xml</dc:replaces>
            </gbif>
        </metadata>
    </additionalMetadata>
</eml:eml>
