由於我們的NAS Bench-2012已擴展到NATS板凳,因此該倉庫被棄用且未維護。請使用NATS板凳,該板凳的體系結構信息多於NAS-Bench-2010。
我們提出了一個具有固定搜索空間的算法 - 敏捷的NAS基準(NAS Bench-2012),該搜索空間幾乎為任何最新的NAS算法提供了統一的基準測試。我們的搜索空間的設計靈感來自最受歡迎的基於單元格的搜索算法,該算法將單元格表示為有向的無環圖。這裡的每個邊緣都與從預定義的操作集中選擇的操作相關聯。為了適用於所有NAS算法,NAS-Bench-201中定義的搜索空間包括4個節點和5個相關的操作選項,總共生成15,625個神經細胞候選者。
在此Markdown文件中,我們提供:
對於以下兩件事,請使用Autodl-Project:
注意:請使用PyTorch >= 1.2.0
和Python >= 3.6.0
。
您只需鍵入pip install nas-bench-201
即可安裝我們的API。請參閱此存儲庫中nas-bench-201
模塊的源代碼。
如果您有任何疑問或問題,請在此處發布或給我發電子郵件。
[棄用]可以從Google Drive或Baidu-Wangpan(代碼:6U5D)下載NAS-Bench-201的舊基准文件。
[推薦] NAS-Bench-201的最新基准文件( NAS-Bench-201-v1_1-096897.pth
)可以從Google Drive下載。模型重量的文件太大(431G),我需要一些時間上傳。請耐心等待,感謝您的理解。
您可以將其移動到所需的任何地方,並將其路徑發送到我們的API以進行初始化。
NAS-Bench-201-v1_0-e61699.pth
(2.2G),其中e61699
是該文件的最後六位數字。它包含除每個試驗訓練的權重外的所有信息。NAS-Bench-201-v1_1-096897.pth
(4.7G),其中096897
是該文件的最後六位數字。與NAS-Bench-201-v1_0-e61699.pth
相比,它包含更多試驗的信息,尤其是所有在所有數據集中受過12個時期訓練的模型都是可避免的。我們建議使用NAS-Bench-201-v1_1-096897.pth
可以從Google Drive或Baidu-Wangpan下載NAS Bench-201中使用的培訓和評估數據(代碼:4FG7)。建議將這些數據放入$TORCH_HOME
( ~/.torch/
默認情況下)。如果您想親自生成NAS BENCH-2010或類似的NAS數據集或培訓模型,則需要這些數據。
在我們的測試代碼中可以找到更多用法。
from nas_201_api import NASBench201API as API
api = API('$path_to_meta_nas_bench_file')
# Create an API without the verbose log
api = API('NAS-Bench-201-v1_1-096897.pth', verbose=False)
# The default path for benchmark file is '{:}/{:}'.format(os.environ['TORCH_HOME'], 'NAS-Bench-201-v1_1-096897.pth')
api = API(None)
len(api)
和每個體系結構api[i]
的架構數量: num = len(api)
for i, arch_str in enumerate(api):
print ('{:5d}/{:5d} : {:}'.format(i, len(api), arch_str))
# show all information for a specific architecture
api.show(1)
api.show(2)
# show the mean loss and accuracy of an architecture
info = api.query_meta_info_by_index(1) # This is an instance of `ArchResults`
res_metrics = info.get_metrics('cifar10', 'train') # This is a dict with metric names as keys
cost_metrics = info.get_comput_costs('cifar100') # This is a dict with metric names as keys, e.g., flops, params, latency
# get the detailed information
results = api.query_by_index(1, 'cifar100') # a dict of all trials for 1st net on cifar100, where the key is the seed
print ('There are {:} trials for this architecture [{:}] on cifar100'.format(len(results), api[1]))
for seed, result in results.items():
print ('Latency : {:}'.format(result.get_latency()))
print ('Train Info : {:}'.format(result.get_train()))
print ('Valid Info : {:}'.format(result.get_eval('x-valid')))
print ('Test Info : {:}'.format(result.get_eval('x-test')))
# for the metric after a specific epoch
print ('Train Info [10-th epoch] : {:}'.format(result.get_train(10)))
index = api.query_index_by_arch('|nor_conv_3x3~0|+|nor_conv_3x3~0|avg_pool_3x3~1|+|skip_connect~0|nor_conv_3x3~1|skip_connect~2|')
api.show(index)
此字符串|nor_conv_3x3~0|+|nor_conv_3x3~0|avg_pool_3x3~1|+|skip_connect~0|nor_conv_3x3~1|skip_connect~2|
方法:
node-0: the input tensor
node-1: conv-3x3( node-0 )
node-2: conv-3x3( node-0 ) + avg-pool-3x3( node-1 )
node-3: skip-connect( node-0 ) + conv-3x3( node-1 ) + skip-connect( node-2 )
config = api.get_net_config(123, 'cifar10') # obtain the network configuration for the 123-th architecture on the CIFAR-10 dataset
from models import get_cell_based_tiny_net # this module is in AutoDL-Projects/lib/models
network = get_cell_based_tiny_net(config) # create the network from configurration
print(network) # show the structure of this architecture
如果要加載此創建的網絡的訓練有素的權重,則需要使用api.get_net_param(123, ...)
獲取權重,然後將其加載到網絡中。
api.get_more_info(...)
可以在培訓 /驗證 /測試集上返回損失 /準確性 /時間,這非常有幫助。有關更多詳細信息,請查看get_more_info函數中的註釋。
有關其他用法,請參閱lib/nas_201_api/api.py
。我們在相應功能的評論中提供了一些用法信息。如果未提供您想要的內容,請隨時開放討論問題,我很樂意回答有關NAS Bench-201的任何問題。
在nas_201_api
中,我們定義了三個類: NASBench201API
, ArchResults
, ResultsCount
。
ResultsCount
維護特定試驗的所有信息。一個人可以實例化結果並通過以下代碼獲取信息( 000157-FULL.pth
節省了157-架構的所有試驗的所有信息):
from nas_201_api import ResultsCount
xdata = torch.load('000157-FULL.pth')
odata = xdata['full']['all_results'][('cifar10-valid', 777)]
result = ResultsCount.create_from_state_dict( odata )
print(result) # print it
print(result.get_train()) # print the final training loss/accuracy/[optional:time-cost-of-a-training-epoch]
print(result.get_train(11)) # print the training info of the 11-th epoch
print(result.get_eval('x-valid')) # print the final evaluation info on the validation set
print(result.get_eval('x-valid', 11)) # print the info on the validation set of the 11-th epoch
print(result.get_latency()) # print the evaluation latency [in batch]
result.get_net_param() # the trained parameters of this trial
arch_config = result.get_config(CellStructure.str2structure) # create the network with params
net_config = dict2config(arch_config, None)
network = get_cell_based_tiny_net(net_config)
network.load_state_dict(result.get_net_param())
ArchResults
保留了建築所有試驗的所有信息。請參閱以下用法:
from nas_201_api import ArchResults
xdata = torch.load('000157-FULL.pth')
archRes = ArchResults.create_from_state_dict(xdata['less']) # load trials trained with 12 epochs
archRes = ArchResults.create_from_state_dict(xdata['full']) # load trials trained with 200 epochs
print(archRes.arch_idx_str()) # print the index of this architecture
print(archRes.get_dataset_names()) # print the supported training data
print(archRes.get_compute_costs('cifar10-valid')) # print all computational info when training on cifar10-valid
print(archRes.get_metrics('cifar10-valid', 'x-valid', None, False)) # print the average loss/accuracy/time on all trials
print(archRes.get_metrics('cifar10-valid', 'x-valid', None, True)) # print loss/accuracy/time of a randomly selected trial
NASBench201API
是最高的API。請參閱以下用法:
from nas_201_api import NASBench201API as API
api = API('NAS-Bench-201-v1_1-096897.pth') # This will load all the information of NAS-Bench-201 except the trained weights
api = API('{:}/{:}'.format(os.environ['TORCH_HOME'], 'NAS-Bench-201-v1_1-096897.pth')) # The same as the above line while I usually save NAS-Bench-201-v1_1-096897.pth in ~/.torch/.
api.show(-1) # show info of all architectures
api.reload('{:}/{:}'.format(os.environ['TORCH_HOME'], 'NAS-BENCH-201-4-v1.0-archive'), 3) # This code will reload the information 3-th architecture with the trained weights
weights = api.get_net_param(3, 'cifar10', None) # Obtaining the weights of all trials for the 3-th architecture on cifar10. It will returns a dict, where the key is the seed and the value is the trained weights.
要獲取培訓和評估信息(請參閱此處的評論):
api.get_more_info(112, 'cifar10', None, hp='200', is_random=True)
# Query info of last training epoch for 112-th architecture
# using 200-epoch-hyper-parameter and randomly select a trial.
api.get_more_info(112, 'ImageNet16-120', None, hp='200', is_random=True)
如果您發現NAS-Bench-201有助於您的研究,請考慮引用它:
@inproceedings{dong2020nasbench201,
title = {NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search},
author = {Dong, Xuanyi and Yang, Yi},
booktitle = {International Conference on Learning Representations (ICLR)},
url = {https://openreview.net/forum?id=HJxyZkBKDr},
year = {2020}
}