Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion nemo_text_processing/text_normalization/normalize.py
Original file line number Diff line number Diff line change
Expand Up @@ -737,7 +737,7 @@ def parse_args():
parser.add_argument(
"--language",
help="language",
choices=["en", "de", "es", "fr", "hu", "sv", "zh", "ar", "it", "hy", "ja", "hi", "ko", "vi"],
choices=["en", "de", "es", "fr", "hu", "sv", "zh", "ar", "it", "hy", "ja", "hi", "ko", "vi", "pt"],
default="en",
type=str,
)
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# Copyright (c) 2026, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
.com ponto com
.com.br ponto com ponto br
.gov.br ponto gov ponto br
.org ponto org
.net ponto net
.edu ponto edu
.br ponto br
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
google google
usuario usuario
microsoft microsoft
amazon amazon
facebook facebook
meta meta
netflix netflix
spotify spotify
samsung samsung
apple apple
linkedin linkedin
instagram instagram
whatsapp whatsapp
oracle oracle
adobe adobe
paypal paypal
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
gmail
nvidia
outlook
hotmail
yahoo
live
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
. ponto
- traço
_ underscore
! exclamação
# cerquilha
$ dólar
% por cento
& e comercial
' apóstrofo
* asterisco
+ mais
/ barra
= igual
? interrogação
^ acento circunflexo
` crase
{ chave esquerda
| barra vertical
} chave direita
~ til
, vírgula
: dois pontos
Original file line number Diff line number Diff line change
Expand Up @@ -2,3 +2,4 @@ connector e
minus menos
plural_suffix s
avos_suffix avos
avos_between
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# Copyright (c) 2026, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
Original file line number Diff line number Diff line change
@@ -0,0 +1,64 @@
h horas
min minutos
s segundos
ms milissegundos
ns nanossegundos
μs microssegundos
t toneladas
kg quilos
kg quilogramas
g gramas
mg miligramas
μm micrômetros
nm nanômetros
mm milímetros
cm centímetros
cm² centímetros quadrados
cm³ centímetros cúbicos
m metros
m² metros quadrados
m³ metros cúbicos
km quilômetros
km² quilômetros quadrados
ha hectares
kph quilômetros por hora
mph milhas por hora
m/s metros por segundo
l litros
ml mililitros
kgf quilogramas força
% por cento
°F fahrenheit
°F graus fahrenheit
°C graus celsius
Hz hertz
kHz quilo hertz
MHz mega hertz
GHz giga hertz
W watts
kW quilowatts
MW megawatts
GW gigawatts
Wh watts hora
kWh quilowatts hora
MWh megawatts hora
GWh gigawatts hora
kV quilovolts
V volts
mV milivolts
A amperes
mA miliamperes
rpm rotações por minuto
db decibéis
cal calorias
kcal quilocalorias
G gramas
KG quilos
KG quilogramas
KM quilômetros
M metros
L litros
ML mililitros
M2 metros quadrados
M^2 metros quadrados
C graus celsius
Original file line number Diff line number Diff line change
@@ -0,0 +1,65 @@
h hora
min minuto
s segundo
ms milissegundo
ns nanossegundo
μs microssegundo
t tonelada
kg quilo
kg quilograma
g grama
mg miligrama
μm micrômetro
nm nanômetro
mm milímetro
cm centímetro
cm² centímetro quadrado
cm³ centímetro cúbico
m metro
m² metro quadrado
m³ metro cúbico
km quilômetro
km² quilômetro quadrado
ha hectare
kph quilômetro por hora
mph milha por hora
m/s metro por segundo
l litro
ml mililitro
kgf quilograma força
% por cento
°F fahrenheit
°C celsius
°F grau fahrenheit
°C grau celsius
Hz hertz
kHz quilo hertz
MHz mega hertz
GHz giga hertz
W watt
kW quilowatt
MW megawatt
GW gigawatt
Wh watt hora
kWh quilowatt hora
MWh megawatt hora
GWh gigawatt hora
kV quilovolt
V volt
mV milivolt
A ampere
mA miliampere
rpm rotação por minuto
db decibel
cal caloria
kcal quilocaloria
G grama
KG quilo
KG quilograma
KM quilômetro
M metro
L litro
ML mililitro
M2 metro quadrado
M^2 metro quadrado
C celsius
13 changes: 13 additions & 0 deletions nemo_text_processing/text_normalization/pt/data/money/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# Copyright (c) 2026, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
US$ dólar americano
R$ real
€ euro
£ libra esterlina
$ dólar
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
dólar americano dólares americanos
real reais
euro euros
libra esterlina libras esterlinas
dólar dólares
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
US$ centavo
R$ centavo
€ centavo
£ centavo
$ centavo
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
centavo centavos
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# Copyright (c) 2026, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
ip ip
endereço de ip endereço de i p
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
ligue para
telefone
celular
meu número é
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
manhã da manhã
tarde da tarde
noite da noite
madrugada da madrugada
manhã da manhã 6 11
tarde da tarde 12 17
noite da noite 18 23
madrugada da madrugada 0 5
Loading