music21.corpus.manager¶
The manager module handles requests across multiple corpora. It should be the default interface to searching corpora.
New in v3 – previously most were static methods on corpus.corpora.Corpus, but that seemed inappropriate since these work across corpora.
Functions¶
- music21.corpus.manager.cacheMetadataBundleFromDisk(corpusObject: Corpus) None ¶
Update a corpus’ metadata bundle from its stored JSON file on disk.
- music21.corpus.manager.fromName(name)¶
Instantiate a specific corpus based on name:
>>> corpus.manager.fromName('core') <music21.corpus.corpora.CoreCorpus>
>>> corpus.manager.fromName('local') <music21.corpus.corpora.LocalCorpus: 'local'>
>>> corpus.manager.fromName(None) <music21.corpus.corpora.LocalCorpus: 'local'>
Note that this corpus probably does not exist on disk, but it’s ready to have paths added to it and to be stored on disk.
>>> corpus.manager.fromName('testDummy') <music21.corpus.corpora.LocalCorpus: 'testDummy'>
- music21.corpus.manager.getMetadataBundleByCorpus(corpusObject: corpora.Corpus) bundles.MetadataBundle ¶
Return the metadata bundle for a single Corpus object
>>> cc = corpus.corpora.CoreCorpus() >>> mdb1 = corpus.manager.getMetadataBundleByCorpus(cc) >>> mdb1 <music21.metadata.bundles.MetadataBundle 'core': {... entries}>
This is the same as calling metadataBundle on the corpus itself, but this is the routine that actually does the work. In other words, it’s the call on the object that is redundant, not this routine.
>>> mdb1 is cc.metadataBundle True
Non-existent corpus:
>>> lc = corpus.corpora.LocalCorpus('junk') >>> mdb1 = corpus.manager.getMetadataBundleByCorpus(lc) >>> mdb1 <music21.metadata.bundles.MetadataBundle 'junk': {0 entries}>
- music21.corpus.manager.getWork(workName: str | Path, movementNumber: int | None = None, *, fileExtensions: Iterable[str] = ()) Path | list[pathlib.Path] ¶
this parse function is called from corpus.parse() and does nothing differently from it.
Searches all corpora for a file that matches the name and returns it parsed.
- music21.corpus.manager.iterateCorpora(returnObjects=True)¶
a generator that iterates over the corpora (either as objects or as names) for use in pan corpus searching.
This test will only show the first two, because it needs to run the same on every system:
>>> for i, corpusObject in enumerate(corpus.manager.iterateCorpora()): ... print(corpusObject) ... if i == 1: ... break <music21.corpus.corpora.CoreCorpus> <music21.corpus.corpora.LocalCorpus: 'local'>
We can also get names instead. (Note that the name of the main local corpus is “local” not None)
>>> for i, corpusName in enumerate(corpus.manager.iterateCorpora(returnObjects=False)): ... print(corpusName) ... if i == 1: ... break core local
New in v3.
- music21.corpus.manager.listLocalCorporaNames(skipNone=False)¶
List the names of all user-defined local corpora.
The entry for None refers to the default local corpus.
- music21.corpus.manager.listSearchFields()¶
List all available search field names:
>>> for field in corpus.manager.listSearchFields(): ... field ... 'abstract' 'accessRights' 'accompanyingMaterialWriter' ... 'composer' 'composerAlias' 'composerCorporate' 'conceptor' 'conductor' ... 'dateCreated' 'dateFirstPublished' 'dateIssued' 'dateModified' 'dateSubmitted' 'dateValid' ... 'tempoFirst' 'tempos' 'textLanguage' 'textOriginalLanguage' 'timeSignatureFirst' 'timeSignatures' 'title' ...
- music21.corpus.manager.parse(workName: str | pathlib.Path, *, movementNumber: int | None = None, number: int | None = None, fileExtensions: Iterable[str] = (), forceSource: bool = False, format: str | None = None) stream.Score | stream.Part | stream.Opus ¶
- music21.corpus.manager.readAllMetadataBundlesFromDisk()¶
Read each corpus’s metadata bundle and store it in memory.
- music21.corpus.manager.search(query: str | None = None, field: str | None = None, *, corpusNames=None, fileExtensions: Iterable[str] | str = (), **keywords)¶
Search all stored metadata bundles and return a list of file paths.
This function uses stored metadata and thus, on first usage, will incur a performance penalty during metadata loading.
>>> corpus.search('china') <music21.metadata.bundles.MetadataBundle {1235 entries}>
>>> corpus.search('china', fileExtensions=('.mid',)) <music21.metadata.bundles.MetadataBundle {0 entries}>
>>> corpus.search('bach', field='composer') <music21.metadata.bundles.MetadataBundle {363 entries}>
Note the importance of good metadata – there’s almost 400 pieces by Bach in the corpus, but many do not have correct metadata entries.
This can also be specified as:
>>> corpus.search(composer='bach') <music21.metadata.bundles.MetadataBundle {363 entries}>
Or, to get all the chorales (without using corpus.chorales.Iterator):
>>> corpus.search(sourcePath='bach', numberOfParts=4) <music21.metadata.bundles.MetadataBundle {368 entries}>
This function is implemented in corpus.manager as a method there but also directly available in the corpus module for ease of use.
The
corpusNames
parameter can be used to specify which corpora to search, for example:>>> corpus.manager.search( ... 'bach', ... corpusNames=('core',), ... ) <music21.metadata.bundles.MetadataBundle {564 entries}>
>>> corpus.manager.search( ... 'bach', ... corpusNames=('core',), ... fileExtensions=('xml',), ... ) <music21.metadata.bundles.MetadataBundle {412 entries}>
If
corpusNames
is None, all corpora known to music21 will be searched.See usersGuide (chapter 11) for more information on searching