|
2024-02-26 22:10:44 | INFO | model_worker | args: Namespace(host='0.0.0.0', port=40004, worker_address='http://localhost:40004', controller_address='http://localhost:10000', model_path='MBZUAI/MobiLlama-08B', revision='main', device='cuda', gpus=None, num_gpus=1, max_gpu_memory=None, dtype=None, load_8bit=False, cpu_offloading=False, gptq_ckpt=None, gptq_wbits=16, gptq_groupsize=-1, gptq_act_order=False, awq_ckpt=None, awq_wbits=16, awq_groupsize=-1, enable_exllama=False, exllama_max_seq_len=4096, exllama_gpu_split=None, exllama_cache_8bit=False, enable_xft=False, xft_max_seq_len=4096, xft_dtype=None, model_names=None, conv_template=None, embed_in_truncate=False, limit_worker_concurrency=5, stream_interval=2, no_register=False, seed=None, debug=False, ssl=False) |
|
2024-02-26 22:10:44 | INFO | model_worker | Loading the model ['MobiLlama-08B'] on worker 7c759f2b ... |
|
2024-02-26 22:10:45 | ERROR | stderr |
Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s] |
|
2024-02-26 22:10:52 | ERROR | stderr |
Loading checkpoint shards: 50%|ββββββββββββββββββββββββββββββββββββββ | 1/2 [00:06<00:06, 6.60s/it] |
|
2024-02-26 22:10:59 | ERROR | stderr |
Loading checkpoint shards: 100%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 2/2 [00:13<00:00, 6.71s/it] |
|
2024-02-26 22:10:59 | ERROR | stderr |
Loading checkpoint shards: 100%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 2/2 [00:13<00:00, 6.69s/it] |
|
2024-02-26 22:10:59 | ERROR | stderr | |
|
2024-02-26 22:10:59 | INFO | model_worker | Register to controller |
|
2024-02-26 22:10:59 | ERROR | stderr | [32mINFO[0m: Started server process [[36m459676[0m] |
|
2024-02-26 22:10:59 | ERROR | stderr | [32mINFO[0m: Waiting for application startup. |
|
2024-02-26 22:10:59 | ERROR | stderr | [32mINFO[0m: Application startup complete. |
|
2024-02-26 22:10:59 | ERROR | stderr | [32mINFO[0m: Uvicorn running on [1mhttp://0.0.0.0:40004[0m (Press CTRL+C to quit) |
|
2024-02-26 22:11:09 | INFO | stdout | [32mINFO[0m: 127.0.0.1:42738 - "[1mPOST /worker_get_status HTTP/1.1[0m" [32m200 OK[0m |
|
2024-02-26 22:11:44 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:12:29 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:13:14 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:14:00 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:14:41 | INFO | stdout | [32mINFO[0m: 127.0.0.1:33018 - "[1mPOST /worker_get_status HTTP/1.1[0m" [32m200 OK[0m |
|
2024-02-26 22:14:45 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:14:59 | INFO | stdout | [32mINFO[0m: 127.0.0.1:60758 - "[1mPOST /worker_get_status HTTP/1.1[0m" [32m200 OK[0m |
|
2024-02-26 22:15:30 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:16:15 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:17:00 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:17:45 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:17:57 | INFO | stdout | [32mINFO[0m: 127.0.0.1:42514 - "[1mPOST /worker_get_status HTTP/1.1[0m" [32m200 OK[0m |
|
2024-02-26 22:18:14 | INFO | stdout | [32mINFO[0m: 127.0.0.1:33310 - "[1mPOST /worker_get_status HTTP/1.1[0m" [32m200 OK[0m |
|
2024-02-26 22:18:30 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:19:15 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:20:00 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:20:45 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:21:30 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:22:15 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:23:00 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:23:45 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:24:30 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:25:15 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:26:00 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:26:45 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:27:30 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:28:15 | INFO | model_worker | Send heart beat. Models: ['MobiLlama-08B']. Semaphore: None. call_ct: 0. worker_id: 7c759f2b. |
|
2024-02-26 22:28:15 | ERROR | model_worker | heart beat error: HTTPConnectionPool(host='localhost', port=10000): Max retries exceeded with url: /receive_heart_beat (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f8bb17cc2b0>: Failed to establish a new connection: [Errno 111] Connection refused')) |
|
2024-02-26 22:28:20 | ERROR | model_worker | heart beat error: HTTPConnectionPool(host='localhost', port=10000): Max retries exceeded with url: /receive_heart_beat (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f8bb17cd8a0>: Failed to establish a new connection: [Errno 111] Connection refused')) |
|
2024-02-26 22:28:25 | ERROR | model_worker | heart beat error: HTTPConnectionPool(host='localhost', port=10000): Max retries exceeded with url: /receive_heart_beat (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f8bb17ce170>: Failed to establish a new connection: [Errno 111] Connection refused')) |
|
2024-02-26 22:28:30 | ERROR | model_worker | heart beat error: HTTPConnectionPool(host='localhost', port=10000): Max retries exceeded with url: /receive_heart_beat (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f8bb17cea40>: Failed to establish a new connection: [Errno 111] Connection refused')) |
|
2024-02-26 22:28:32 | ERROR | stderr | [32mINFO[0m: Shutting down |
|
2024-02-26 22:28:32 | ERROR | stderr | [32mINFO[0m: Waiting for application shutdown. |
|
2024-02-26 22:28:32 | ERROR | stderr | [32mINFO[0m: Application shutdown complete. |
|
2024-02-26 22:28:32 | ERROR | stderr | [32mINFO[0m: Finished server process [[36m459676[0m] |
|
|