![]() |
libdl
0.0.1
Simple yet powerful deep learning
|
Implements WordPiece tokenizaton as proposed in [9] using the optimized algorithm by [7] . More...
#include <wordpiece.hpp>
Classes | |
| struct | Conf |
Public Types | |
| using | PieceIter = utils::GenericIterator< std::tuple< std::string, std::size_t > > |
Public Member Functions | |
| std::vector< size_t > | tokenize (const std::string &text) const noexcept |
Static Public Member Functions | |
| static WordPieceTokenizer | fromConf (std::istream &stream) noexcept |
Implements WordPiece tokenizaton as proposed in [9] using the optimized algorithm by [7] .
Definition at line 28 of file wordpiece.hpp.
| using dl::WordPieceTokenizer::PieceIter = utils::GenericIterator<std::tuple<std::string, std::size_t> > |
Definition at line 30 of file wordpiece.hpp.