Metadata-Version: 1.1
Name: coquery
Version: 0.9.2
Summary: Coquery: A free corpus query tool
Home-page: http://www.coquery.org
Author: Gero Kunter
Author-email: gero.kunter@coquery.org
License: GPL3
Description: Coquery - a free corpus query tool
        ==================================
        
        Coquery is a free corpus query tool for linguists, lexicographers,
        translators, and anybody who wishes to search and analyse text corpora.
        It is available for Windows, Linux, and Mac OS X computers.
        
        You can either build your own corpus from a collection of text files (either 
        PDF, MS Word, OpenDocument, HTML, or plain text) in a directory on your 
        computer, or install a corpus module for one of the supported corpora (the 
        corpus data files are not provided by Coquery).
        
        .. figure:: http://www.coquery.org/_images/showcase.png
           :alt: Coquery: Main interface
        
           Coquery: Main interface
        
        Tutorials and documentation can be found on the Coquery website:
        http://www.coquery.org
        
        Features
        --------
        
        An incomplete list of the things you can do with Coquery:
        
        Corpora
        ~~~~~~~
        
        -  Use the corpus manager to install one of the supported corpora
        -  Build your own corpus from PDF, HTML, .docx, .odt, or plain text files
        -  Filter your query for example by year, genre, or speaker gender
        -  Choose which corpus features will be included in your query results
        -  View every token that matches your query within its context
        
        Queries
        ~~~~~~~
        
        -  Match tokens by orthography, phonetic transcription, lemma, or gloss,
           and restrict your query by part-of-speech
        -  Use string functions e.g. to test if a token contains a letter
           sequence
        -  Use the same query syntax for all installed corpora
        -  Automate queries by reading them from an input file
        
        Analysis
        ~~~~~~~~
        
        -  Summarize the query results as frequency tables or contingency tables
        -  Calculate entropies and relative frequencies
        -  Fetch collocations, and calculate association statistics like mutual
           information scores or conditional probabilities
        
        Visualizations
        ~~~~~~~~~~~~~~
        
        -  Use bar charts, heat maps, or bubble charts to visualize frequency
           distributions
        -  Illustrate diachronic changes by using time series plots
        -  Show the distribution of tokens within a corpus in a barcode or a
           beeswarm plot
        
        Databases
        ~~~~~~~~~
        
        -  Either connect to easy-to-use internal databases, or to powerful
           MySQL servers
        -  Access large databases on a MySQL server over the network
        -  Create links between tables from different corpora, e.g. to provide
           phonetic transcriptions for tokens in an unannotated corpus
        
        Supported corpora
        -----------------
        
        Coquery already has installers for the following linguistic corpora and
        lexical databases:
        
        -  `British National Corpus (BNC)`_
        -  `Buckeye Corpus`_
        -  `CELEX Lexical Database (English)`_
        -  `Carnegie Mellon Pronunciation Dictionary (CMUdict)`_
        -  `Corpus of Contemporary American English (COCA)`_
        -  `Corpus of Historical American English (COHA)`_
        -  `Ġabra: an open lexicon for Maltese`_
        -  `ICE-Nigeria`_
        -  `Switchboard-1 Telephone Speech Corpus`_
        
        .. _British National Corpus (BNC): http://www.natcorp.ox.ac.uk/
        .. _Buckeye Corpus: http://buckeyecorpus.osu.edu/
        .. _CELEX Lexical Database (English): https://catalog.ldc.upenn.edu/LDC96L14
        .. _Carnegie Mellon Pronunciation Dictionary (CMUdict): http://www.speech.cs.cmu.edu/cgi-bin/cmudict
        .. _Corpus of Contemporary American English (COCA): http://corpus.byu.edu/coca/
        .. _Corpus of Historical American English (COHA): http://corpus.byu.edu/coha/
        .. _`Ġabra: an open lexicon for Maltese`: http://mlrs.research.um.edu.mt/resources/gabra/
        .. _ICE-Nigeria: http://sourceforge.net/projects/ice-nigeria/
        .. _Switchboard-1 Telephone Speech Corpus: https://catalog.ldc.upenn.edu/LDC97S62
        
        If the list is missing a corpus that you want to see supported in Coquery, 
        you can either write your own corpus installer in Python using the installer 
        API, or you can [contact](http://www.coquery.org/contact) the Coquery 
        maintainers and ask them for assistance.
        
        License
        -------
        
        Copyright (c) 2016 Gero Kunter
        
        Initial development was supported by:
            English Linguistics
            Institut für Amerikanistik und Amerikanistik
            Heinrich-Heine-Universität Düsseldorf
            
        Coquery is free software released under the terms of the 
        `GNU General Public license (version 3) <http://www.coquery.org/license>`_
Keywords: corpus linguistics query corpora analysis visualization
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: GNU General Public License v3 (GPLv3)
Classifier: Operating System :: MacOS
Classifier: Operating System :: Microsoft :: Windows
Classifier: Operating System :: POSIX
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.3
Classifier: Programming Language :: Python :: 3.4
Classifier: Topic :: Education
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Scientific/Engineering :: Visualization
Classifier: Topic :: Text Processing :: Linguistic
